Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Feng's picture
4 8

Feng

Yunzhen
y8phi's profile picture
·
https://fengyzpku.github.io/
  • fengyzpku

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 3 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 32

Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

Paper • 2510.08696 • Published Oct 9, 2025 • 15
upvoted 3 papers 4 months ago

Rethinking Thinking Tokens: LLMs as Improvement Operators

Paper • 2510.01123 • Published Oct 1, 2025 • 6

Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23, 2025 • 16

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Paper • 2509.19284 • Published Sep 23, 2025 • 23
upvoted 2 papers 12 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7, 2025 • 22

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published Feb 6, 2025 • 12
upvoted a paper almost 2 years ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 49
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs