The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models Paper • 2601.03425 • Published 10 days ago • 15
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 8 days ago • 191
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 10 days ago • 121
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 12 days ago • 52
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 10 days ago • 95
SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 21 days ago • 38
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 18 days ago • 94
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 22 days ago • 24
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published 21 days ago • 17
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 29 days ago • 96
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models Paper • 2512.19526 • Published 25 days ago • 11
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published Dec 15, 2025 • 10
FaithLens: Detecting and Explaining Faithfulness Hallucination Paper • 2512.20182 • Published 25 days ago • 8
WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion Paper • 2512.19678 • Published 25 days ago • 29
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Paper • 2512.17260 • Published 29 days ago • 49