Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling Paper • 2505.12225 • Published May 18 • 4
Model-Based Differentially Private Knowledge Transfer for Large Language Models Paper • 2410.10481 • Published Oct 14, 2024 • 1
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6