arxiv:2508.03990
Bill Avan
BillAvan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 24 hours ago
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
upvoted
a
paper
4 months ago
Who's Your Judge? On the Detectability of LLM-Generated Judgments
authored
a paper
6 months ago
Are Today's LLMs Ready to Explain Well-Being Concepts?
Organizations
None yet