dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11 • 52.5B • 169k • 2.56k google/smol Viewer • Updated Oct 31 • 798k • 5.65k • 79
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 609k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 4.34M • 228 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 365k • • 2.61k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 10.6M • • 5.18k
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26 • 222k • 465 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18 • 450k • 13.3k • 687 facebook/natural_reasoning Viewer • Updated Feb 21 • 1.15M • 1.56k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31 • 228k • 111k • 778
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 7.91M • 3.06k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 158k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 158k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6 • 67 • 6
dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11 • 52.5B • 169k • 2.56k google/smol Viewer • Updated Oct 31 • 798k • 5.65k • 79
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26 • 222k • 465 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18 • 450k • 13.3k • 687 facebook/natural_reasoning Viewer • Updated Feb 21 • 1.15M • 1.56k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31 • 228k • 111k • 778
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 7.91M • 3.06k
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 609k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 4.34M • 228 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 365k • • 2.61k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 10.6M • • 5.18k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 158k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 158k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6 • 67 • 6