MindJourney: Test-Time Scaling with World Models for Spatial Reasoning Paper • 2507.12508 • Published Jul 16, 2025 • 26
MIRIX: Multi-Agent Memory System for LLM-Based Agents Paper • 2507.07957 • Published Jul 10, 2025 • 79
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published Jun 11, 2025 • 8
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20, 2025 • 29
Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper • 2505.23604 • Published May 29, 2025 • 23