BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search Paper • 2601.11037 • Published 4 days ago • 12 • 3
AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems Paper • 2601.11354 • Published 4 days ago • 2 • 3
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published 4 days ago • 23 • 4
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 4 days ago • 25 • 3
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models Paper • 2601.11404 • Published 4 days ago • 20 • 3
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 4 days ago • 13 • 3
PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records Paper • 2601.09636 • Published 6 days ago • 5 • 4
Future Optical Flow Prediction Improves Robot Control & Video Generation Paper • 2601.10781 • Published 5 days ago • 10 • 2
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published 4 days ago • 7 • 3
Entropy Sentinel: Continuous LLM Accuracy Monitoring from Decoding Entropy Traces in STEM Paper • 2601.09001 • Published 7 days ago • 12 • 3
More Images, More Problems? A Controlled Analysis of VLM Failure Modes Paper • 2601.07812 • Published 8 days ago • 3 • 3
What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge Paper • 2601.10922 • Published 4 days ago • 1 • 2
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper • 2601.09195 • Published 6 days ago • 11 • 6
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Paper • 2601.11496 • Published 4 days ago • 41 • 3
PhyRPR: Training-Free Physics-Constrained Video Generation Paper • 2601.09255 • Published 6 days ago • 2 • 3
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper • 2601.10355 • Published 5 days ago • 33 • 4
Language of Thought Shapes Output Diversity in Large Language Models Paper • 2601.11227 • Published 4 days ago • 1 • 3