Running on CPU Upgrade Featured 2.68k The Smol Training Playbook π 2.68k The secrets to building world-class LLMs
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 286 β’ 44
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 286 β’ 44
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 286 β’ 44
Running Featured 1.23k FineWeb: decanting the web for the finest text data at scale π· 1.23k Generate high-quality text data for LLMs using FineWeb
Running on Zero Featured 5.34k IllusionDiffusion π 5.34k Generate stunning high quality illusion artwork
ironbar/dqn-SpaceInvadersNoFrameskip-v4-1M-steps Reinforcement Learning β’ Updated Jun 12, 2022 β’ 5