·
AI & ML interests
None yet
Organizations
flyingbugs/bi_unlearn_wmdp
Text Generation
•
7B
•
Updated
•
3
flyingbugs/OpenR1-Qwen-math-7B-SFT-mid-only
Text Generation
•
8B
•
Updated
•
1
flyingbugs/qwen-65-open-r1
Text Generation
•
8B
•
Updated
flyingbugs/GeneralThought-195K-65-qwen7b
Text Generation
•
8B
•
Updated
•
3
flyingbugs/limo-solutions-deepseek-qwen-7b
Text Generation
•
8B
•
Updated
flyingbugs/deepseek-distilled-qwen-7b-rl
Text Generation
•
8B
•
Updated
•
2
flyingbugs/Qwen2.5-Math-7B-limo-32b
Text Generation
•
8B
•
Updated
•
1
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos-new
Text Generation
•
2B
•
Updated
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic-new
Text Generation
•
2B
•
Updated
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos
Text Generation
•
2B
•
Updated
•
2
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic
Text Generation
•
2B
•
Updated
flyingbugs/OpenR1-Qwen-7B-SFT-65
Text Generation
•
333k
•
Updated
•
5
flyingbugs/OlympicCoder-7B
333k
•
Updated
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos
Text Generation
•
2B
•
Updated
•
1
flyingbugs/granite3.3-8b-reinforce_plus-math_different_reward_global_step60_hf
Text Generation
•
8B
•
Updated
flyingbugs/granite3.3-8b-math-pku-rlhf-reinforce-plus
Text Generation
•
8B
•
Updated
flyingbugs/granite_pku_saferlhf_reinforce_plus_plus
Text Generation
•
8B
•
Updated
•
1
flyingbugs/granite_star_1_limo_1e5
Text Generation
•
8B
•
Updated
flyingbugs/granite_star_1_limo
Text Generation
•
8B
•
Updated
flyingbugs/granite_star_1
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-add-aime
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-keep-0.5-end-start-0.5-add-aime
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-head
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-full
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-tail
Text Generation
•
8B
•
Updated
•
4
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-keep-0.5-end-start-0.5-random-perturbation
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-keep-0.75-end-start-0.0
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-middle
Text Generation
•
8B
•
Updated
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-think-mid
Text Generation
•
8B
•
Updated
•
3
flyingbugs/Qwen2.5-Math-7B-s1k
Text Generation
•
8B
•
Updated