Telugu-LLM-Labs/telugu_teknium_GPTeacher_general_instruct_filtered_romanized Viewer • Updated Jan 30, 2024 • 43.6k • 35 • 12
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback Paper • 2503.22230 • Published Mar 28 • 45