Running on CPU Upgrade Featured 2.75k The Smol Training Playbook π 2.75k The secrets to building world-class LLMs
Running 3.61k The Ultra-Scale Playbook π 3.61k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ 71B β’ Updated Feb 24 β’ 96.8k β’ β’ 734
Running on CPU Upgrade Featured 992 Model Memory Utility π 992 Calculate vRAM needed for model training and inference
BAAI/bge-reranker-v2-minicpm-layerwise Text Classification β’ 3B β’ Updated Mar 19, 2024 β’ 1.75k β’ 63