FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines Paper • 2307.15475 • Published Jul 28, 2023
Large Language Models Must Be Taught to Know What They Don't Know Paper • 2406.08391 • Published Jun 12, 2024 • 1
Can Large Language Models Understand Symbolic Graphics Programs? Paper • 2408.08313 • Published Aug 15, 2024 • 7
General Scales Unlock AI Evaluation with Explanatory and Predictive Power Paper • 2503.06378 • Published Mar 9 • 1
Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models Paper • 2507.12547 • Published Jul 16
Evaluating Language Models for Mathematics through Interactions Paper • 2306.01694 • Published Jun 2, 2023 • 2