Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
98.0
TFLOPS
72
123
264
Asankhaya Sharma
codelion
Follow
evdcush's profile picture
jed-tiotuico's profile picture
Ystar124's profile picture
386 followers
·
21 following
http://asankhaya.github.io/
asankhaya
codelion
asankhaya
AI & ML interests
Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.
Recent Activity
updated
a Space
2 days ago
codelion/pts-visualizer
reacted
to
their
post
with 👍
2 days ago
Introducing PTS Visualizer - an interactive tool for exploring how language models reason! Visualize pivotal tokens, thought anchors, and reasoning circuits. See which tokens and sentences significantly impact success probability, explore embedding clusters, and trace reasoning step-by-step. Try it: https://huggingface.co/spaces/codelion/pts-visualizer Explore PTS datasets: - Qwen3-0.6B: https://huggingface.co/datasets/codelion/Qwen3-0.6B-pts - DeepSeek-R1: https://huggingface.co/datasets/codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts Or upload your own JSONL files! GitHub: https://github.com/codelion/pts
reacted
to
their
post
with 🤗
2 days ago
Introducing PTS Visualizer - an interactive tool for exploring how language models reason! Visualize pivotal tokens, thought anchors, and reasoning circuits. See which tokens and sentences significantly impact success probability, explore embedding clusters, and trace reasoning step-by-step. Try it: https://huggingface.co/spaces/codelion/pts-visualizer Explore PTS datasets: - Qwen3-0.6B: https://huggingface.co/datasets/codelion/Qwen3-0.6B-pts - DeepSeek-R1: https://huggingface.co/datasets/codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts Or upload your own JSONL files! GitHub: https://github.com/codelion/pts
View all activity
Organizations
codelion
's models
27
Sort: Recently updated
codelion/gpt-2-70m
Text Generation
•
64.1M
•
Updated
Nov 2
•
695
•
17
codelion/Qwen3-4B-execution-world-model-lora
Text Generation
•
Updated
Oct 20
•
36
•
3
codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora
Text Generation
•
Updated
Aug 2
•
4
codelion/qwen2-5-coder-0-5b-instruct-progressive-2000k-lora
Text Generation
•
Updated
Jul 20
•
5
codelion/Llama-3.2-1B-Instruct-tool-calling-lora
Text Generation
•
Updated
Jul 18
•
101
•
4
codelion/gemma-3-1b-it-reasoning-grpo-lora
Text Generation
•
Updated
Jul 18
•
15
•
5
codelion/Qwen3-0.6B-ICM-DPO
Text Generation
•
0.6B
•
Updated
Jul 18
•
11
codelion/gemma-3-1b-it-ICM-DPO
Text Generation
•
1.0B
•
Updated
Jul 18
•
13
codelion/gemma-3-1b-it-ICM-DPO-mlx-fp16
Text Generation
•
1B
•
Updated
Jul 17
•
22
codelion/Qwen3-0.6B-ICM-DPO-mlx-fp16
Text Generation
•
0.6B
•
Updated
Jul 17
•
28
•
2
codelion/Qwen3-0.6B-accuracy-recovery-lora
Text Generation
•
Updated
Jul 13
•
55
•
4
codelion/Qwen3-0.6B-GRPO-mlx-fp16
Text Generation
•
0.6B
•
Updated
Jul 11
•
7
codelion/Qwen3-0.6B-GRPO
Text Generation
•
0.6B
•
Updated
Jul 11
•
5
codelion/DeepSeek-R1-Distill-Qwen-1.5B-PTS-DPO
Text Generation
•
2B
•
Updated
May 13
•
11
•
2
codelion/Qwen3-0.6B-PTS-DPO
Text Generation
•
0.6B
•
Updated
May 12
•
20
•
1
codelion/Qwen3-0.6B-PTS-DPO-LoRA
Updated
May 7
•
1
codelion/optillm-bert-uncased
Updated
Feb 16
•
39
•
5
codelion/optillm-modernbert-large
Updated
Feb 16
•
25
•
9
codelion/Llama-3.3-70B-o1
Text Generation
•
71B
•
Updated
Jan 21
•
74
•
•
2
codelion/Llama-3.3-70B-o1-gguf
71B
•
Updated
Jan 20
•
94
•
1
codelion/Llama-3.3-70B-o1-lora
Updated
Jan 20
•
2
codelion/Llama-3.2-3B-o1
3B
•
Updated
Jan 12
•
69
•
5
codelion/Llama-3.2-3B-o1-lora
Updated
Jan 12
•
4
codelion/MathCoT
8B
•
Updated
Nov 26, 2024
•
49
•
2
codelion/scorelora
Updated
Oct 15, 2024
•
6
•
3
codelion/public-domain-mickey-mouse
Text-to-Image
•
Updated
Jan 5, 2024
•
14
•
•
2
codelion/whisper-age-estimator
Automatic Speech Recognition
•
72.6M
•
Updated
Sep 10, 2023
•
29
•
3