18 34 63

Debasish Dhal

DebasishDhal99

AI & ML interests

None yet

Recent Activity

upvoted an article about 20 hours ago

The Optimal Architecture for Small Language Models

liked a dataset 19 days ago

amd/SAND-Post-Training-Dataset

new activity about 1 month ago

nanonets/Nanonets-OCR2-3B:Is adding a requirements.txt possible for this model?

View all activity

Organizations

upvoted an article about 20 hours ago

Article

The Optimal Architecture for Small Language Models

3 days ago

•

upvoted a paper 3 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 142

upvoted an article 4 months ago

Article

How to Choose the Best Open Source LLM for Your Project in 2025

Sep 9

•

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 194

upvoted a collection 4 months ago

EmbeddingGemma

Collection

3 items • Updated Sep 11 • 105

upvoted a paper 4 months ago

Benchmarking Optimizers for Large Language Model Pretraining

Paper • 2509.01440 • Published Sep 1 • 24

upvoted a collection 4 months ago

GLM-4.5

Collection

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 250

upvoted 2 papers 4 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 116

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

upvoted 4 papers 5 months ago

GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing

Paper • 2508.02831 • Published Aug 4 • 11

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24 • 40

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 35

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259

upvoted an article 6 months ago

Article

Featherless AI on Hugging Face Inference Providers 🔥

Jun 12

•

upvoted 3 papers 6 months ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 64

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 54

upvoted 3 papers 7 months ago

Debasish Dhal

AI & ML interests

Recent Activity

Organizations

DebasishDhal99's activity

The Optimal Architecture for Small Language Models

How to Choose the Best Open Source LLM for Your Project in 2025

Featherless AI on Hugging Face Inference Providers 🔥