Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sweaterdog 's Collections
Andy-4
Smol-reason
Andy-3.5

Smol-reason

updated Aug 7, 2025

My first ever usage of GRPO fine tuning techniques, information learned from this model will be used on future Andy models.

Upvote
-

  • Sweaterdog/Smol-reason2.1

    3B • Updated Apr 16, 2025 • 30

  • Sweaterdog/Smol-reason2

    3B • Updated Apr 4, 2025 • 32

  • Sweaterdog/Smol-Reason

    3B • Updated Apr 4, 2025 • 47 • 1

  • Sweaterdog/Smol-reason-LoRA

    Updated Apr 4, 2025

  • Sweaterdog/Andy-4-preview-reasoning

    Viewer • Updated Mar 30, 2025 • 13.4k • 7

    Note Datasets for the Smol-reason family


  • openai/gsm8k

    Benchmark • Updated Dec 20, 2025 • 17.6k • 488k • 1.14k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs