8 40 44

Leon Tsou

xxrjun

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

nvidia/DeepSeek-R1-0528-NVFP4:What does “AA Ref” mean in NVIDIA model benchmarks?

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.75k

The secrets to building world-class LLMs

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 384k • • 2.39k

liked a model 4 months ago

kernels-community/vllm-flash-attn3

Updated Oct 27 • 35

liked a model 5 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 516k • • 12.9k

liked a dataset 7 months ago

GPUMODE/KernelBook

Viewer • Updated Jun 25 • 18.2k • 777 • 45

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.61k

The ultimate guide to training LLM on large GPU Clusters

liked a model 10 months ago

MediaTek-Research/BreezyVoice

Updated Feb 18 • 49

liked 3 models 11 months ago

liked a Space about 1 year ago

Model Memory Utility

🚀

992

Calculate vRAM needed for model training and inference

liked 2 models about 1 year ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.2M • • 1.25k

BAAI/bge-reranker-v2-minicpm-layerwise

Text Classification • 3B • Updated Mar 19, 2024 • 1.75k • 63

liked a dataset over 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 24.7k • 1.53k

liked a model over 1 year ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 684k • • 12.1k

liked a Space over 1 year ago

Calculate Model Flops

🔥

Calculate FLOPs and parameters for transformer models

liked a model over 1 year ago

meta-llama/CodeLlama-7b-Python-hf

Text Generation • 7B • Updated Mar 14, 2024 • 697 • 25

liked 3 datasets over 1 year ago

ise-uiuc/Magicoder-OSS-Instruct-75K

Viewer • Updated Dec 4, 2023 • 75.2k • 1.55k • 157

google-research-datasets/mbpp

Viewer • Updated Jan 4, 2024 • 1.4k • 2.67M • 195

codeparrot/apps

Updated Oct 20, 2022 • 16.1k • 189