interesting papers
updated
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
•
2502.02737
•
Published
•
252
A Survey of Context Engineering for Large Language Models
Paper
•
2507.13334
•
Published
•
259
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
•
2501.12948
•
Published
•
432
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
•
2501.08313
•
Published
•
300
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
•
2501.17161
•
Published
•
123
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper
•
2501.05441
•
Published
•
95
Distiller: A Systematic Study of Model Distillation Methods in Natural
Language Processing
Paper
•
2109.11105
•
Published
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
•
2509.08827
•
Published
•
190
LongLive: Real-time Interactive Long Video Generation
Paper
•
2509.22622
•
Published
•
184
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP
Use
Paper
•
2509.24002
•
Published
•
174
Paper
•
2508.10104
•
Published
•
291