This repo contains the LoRA adapter for SkeptiSTEM-4B, fine-tuned from unsloth/Qwen3-4B-Base.
unsloth/Qwen3-4B-Base
Stage: R1 STEM SFT (math + science + coding mixture).
-
Base model