-
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-400
2B • Updated • 10 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-1000
2B • Updated • 2 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-200
2B • Updated • 1 -
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-50
2B • Updated • 3
alphaXiv PRO
alphaXiv
AI & ML interests
None yet
Recent Activity
updated
a collection
about 10 hours ago
spurious-rewards
updated
a collection
about 10 hours ago
spurious-rewards
updated
a collection
about 10 hours ago
spurious-rewards
Organizations
None yet