RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 19
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21, 2025 • 111
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science Paper • 2510.16872 • Published Oct 19, 2025 • 106
meituan-longcat/LongCat-Flash-Thinking Text Generation • 562B • Updated Sep 24, 2025 • 46 • 148
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1, 2025 • 250
WebDancer: Towards Autonomous Information Seeking Agency Paper • 2505.22648 • Published May 28, 2025 • 33
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Paper • 2505.10468 • Published May 15, 2025 • 9
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7, 2025 • 65
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published Apr 30, 2025 • 49
deepseek-ai/DeepSeek-Prover-V2-671B Text Generation • 685B • Updated Apr 30, 2025 • 594 • • 815
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3, 2025 • 89
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published Feb 28, 2025 • 38
Region-Adaptive Sampling for Diffusion Transformers Paper • 2502.10389 • Published Feb 14, 2025 • 53