HallD
/

SkeptiSTEM-4B-stageR1-lora

Model card Files Files and versions

SkeptiSTEM-4B (stageR1)

This repo contains the LoRA adapter for SkeptiSTEM-4B, fine-tuned from unsloth/Qwen3-4B-Base.

Stage: R1 STEM SFT (math + science + coding mixture).

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for HallD/SkeptiSTEM-4B-stageR1-lora

Base model

Qwen/Qwen3-4B-Base

Finetuned

unsloth/Qwen3-4B-Base

Adapter

(21)

this model