When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration Paper • 2506.05579 • Published Jun 5, 2025 • 4
IMPersona: Evaluating Individual Level LM Impersonation Paper • 2504.04332 • Published Apr 6, 2025 • 2
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs +5 Apr 16, 2024 • 16