Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

524

Full-text search

Active filters: quantization

sandeshrajx/gpt-oss-20b-reap-0.4-mxfp4-gguf

Text Generation • 14B • Updated 4 days ago • 120

jgerster0/Apertus-8B-Instruct-2509-W8A16

3B • Updated 4 days ago • 1.5k

coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-W4A16

Updated 3 days ago • 16

zahraase1im/mistral-7b-awq-4bit-rag

7B • Updated 3 days ago • 17

mradermacher/Fairy2i-W2-GGUF

Text Generation • 7B • Updated 2 days ago • 173

mradermacher/Fairy2i-W2-i1-GGUF

Text Generation • 7B • Updated 2 days ago • 546

trithemius/Velvet-14B-nvfp4

8B • Updated 1 day ago • 9

jgerster0/Apertus-8B-Instruct-2509-W8A16-CALIBRATED

3B • Updated 1 day ago • 1.03k

Tfloow/Llama-3.2-1B-adpq-4bit-sim-16workers

1B • Updated 1 day ago • 14

tsqn/Z-Image-Turbo_fp8_comfyui

Text-to-Image • Updated about 23 hours ago • 5

Tfloow/Llama-3.2-1B-adpq-4bit-sim-0.02

1B • Updated about 16 hours ago

ogiwrghs/Phi-3-medium-128k-instruct-GGUF

Text Generation • 14B • Updated about 9 hours ago

avtc/GLM-4.5-Air-GPTQMODEL-W4A16

Text Generation • 121B • Updated about 7 hours ago

jgerster0/Apertus-8B-Instruct-2509-LOGEQ-FP8_dynamic

8B • Updated about 10 hours ago