Asankhaya Sharma's picture

In a Training Loop 🔄

Asankhaya Sharma

codelion

·

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and Ellora. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

updated a Space 2 days ago

codelion/pts-visualizer

reacted to their post with 👍 2 days ago

Introducing PTS Visualizer - an interactive tool for exploring how language models reason! Visualize pivotal tokens, thought anchors, and reasoning circuits. See which tokens and sentences significantly impact success probability, explore embedding clusters, and trace reasoning step-by-step. Try it: https://huggingface.co/spaces/codelion/pts-visualizer Explore PTS datasets: - Qwen3-0.6B: https://huggingface.co/datasets/codelion/Qwen3-0.6B-pts - DeepSeek-R1: https://huggingface.co/datasets/codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts Or upload your own JSONL files! GitHub: https://github.com/codelion/pts

reacted to their post with 🤗 2 days ago

Introducing PTS Visualizer - an interactive tool for exploring how language models reason! Visualize pivotal tokens, thought anchors, and reasoning circuits. See which tokens and sentences significantly impact success probability, explore embedding clusters, and trace reasoning step-by-step. Try it: https://huggingface.co/spaces/codelion/pts-visualizer Explore PTS datasets: - Qwen3-0.6B: https://huggingface.co/datasets/codelion/Qwen3-0.6B-pts - DeepSeek-R1: https://huggingface.co/datasets/codelion/DeepSeek-R1-Distill-Qwen-1.5B-pts Or upload your own JSONL files! GitHub: https://github.com/codelion/pts

View all activity

Organizations

codelion 's models 27

codelion/gpt-2-70m

Text Generation • 64.1M • Updated Nov 2 • 695 • 17

codelion/Qwen3-4B-execution-world-model-lora

Text Generation • Updated Oct 20 • 36 • 3

codelion/Qwen2.5-Coder-0.5B-Instruct-security-grpo-lora

Text Generation • Updated Aug 2 • 4

codelion/qwen2-5-coder-0-5b-instruct-progressive-2000k-lora

Text Generation • Updated Jul 20 • 5

codelion/Llama-3.2-1B-Instruct-tool-calling-lora

Text Generation • Updated Jul 18 • 101 • 4

codelion/gemma-3-1b-it-reasoning-grpo-lora

Text Generation • Updated Jul 18 • 15 • 5

codelion/Qwen3-0.6B-ICM-DPO

Text Generation • 0.6B • Updated Jul 18 • 11

codelion/gemma-3-1b-it-ICM-DPO

Text Generation • 1.0B • Updated Jul 18 • 13

codelion/gemma-3-1b-it-ICM-DPO-mlx-fp16

Text Generation • 1B • Updated Jul 17 • 22

codelion/Qwen3-0.6B-ICM-DPO-mlx-fp16

Text Generation • 0.6B • Updated Jul 17 • 28 • 2

codelion/Qwen3-0.6B-accuracy-recovery-lora

Text Generation • Updated Jul 13 • 55 • 4

codelion/Qwen3-0.6B-GRPO-mlx-fp16

Text Generation • 0.6B • Updated Jul 11 • 7

codelion/Qwen3-0.6B-GRPO

Text Generation • 0.6B • Updated Jul 11 • 5

codelion/DeepSeek-R1-Distill-Qwen-1.5B-PTS-DPO

Text Generation • 2B • Updated May 13 • 11 • 2

codelion/Qwen3-0.6B-PTS-DPO

Text Generation • 0.6B • Updated May 12 • 20 • 1

codelion/Qwen3-0.6B-PTS-DPO-LoRA

Updated May 7 • 1

codelion/optillm-bert-uncased

Updated Feb 16 • 39 • 5

codelion/optillm-modernbert-large

Updated Feb 16 • 25 • 9

codelion/Llama-3.3-70B-o1

Text Generation • 71B • Updated Jan 21 • 74 • • 2

codelion/Llama-3.3-70B-o1-gguf

71B • Updated Jan 20 • 94 • 1

codelion/Llama-3.3-70B-o1-lora

Updated Jan 20 • 2

codelion/Llama-3.2-3B-o1

3B • Updated Jan 12 • 69 • 5

codelion/Llama-3.2-3B-o1-lora

Updated Jan 12 • 4

codelion/MathCoT

8B • Updated Nov 26, 2024 • 49 • 2

codelion/scorelora

Updated Oct 15, 2024 • 6 • 3

codelion/public-domain-mickey-mouse

Text-to-Image • Updated Jan 5, 2024 • 14 • • 2

codelion/whisper-age-estimator

Automatic Speech Recognition • 72.6M • Updated Sep 10, 2023 • 29 • 3