Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

267

Full-text search

Active filters: nvfp4

Sehyo/Qwen3.5-122B-A10B-NVFP4

Image-Text-to-Text • 71B • Updated 14 days ago • 188k • 43

Sehyo/Qwen3.5-35B-A3B-NVFP4

Updated 15 days ago • 62k • 27

Kbenkhaled/Qwen3.5-27B-NVFP4

Image-Text-to-Text • 17B • Updated 17 days ago • 22k • 22

DreamFast/gemma-3-12b-it-heretic-v2

Text Generation • 12B • Updated 6 days ago • 2.32k • 8

mconcat/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-NVFP4

Text Generation • 22B • Updated 6 days ago • 2.37k • 6

drbaph/FireRed-Image-Edit-1.1_ComfyUI_Quants

Image-to-Image • Updated 6 days ago • 354 • 6

AxionML/Qwen3.5-27B-NVFP4

Image-Text-to-Text • 17B • Updated 14 days ago • 2.45k • 4

DreamFast/qwen3-4b-heretic

Text Generation • 4B • Updated 6 days ago • 868 • 4

lukealonso/Qwen3.5-397B-A17B-NVFP4

Text Generation • Updated 5 days ago • 4.35k • 4

Sikaworld1990/gemma-3-12b-qat-abliterated-sikaworld-fp4-ltx2

Feature Extraction • Updated 5 days ago • 4

GadflyII/Qwen3-Coder-Next-NVFP4

Text Generation • Updated Feb 4 • 674k • 37

tacos4me/Step-3.5-Flash-NVFP4

Text Generation • 111B • Updated 23 days ago • 2.77k • 9

nota-ai/Solar-Open-100B-NotaMoEQuant-NVFP4

Text Generation • 59B • Updated 5 days ago • 176 • 3

ApacheOne/FLUX.2-klein-9b-kv-nvfp4_mixed

Updated 4 days ago • 86 • 3

nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4

Text Generation • Updated Feb 9 • 129k • 52

scottgl/Qwen3.5-122B-A10B-MTP-NVFP4

Updated 15 days ago • 725 • 3

saricles/MiniMax-M2.5-REAP-172B-A10B-NVFP4-GB10

Text Generation • 98B • Updated 16 days ago • 750 • 4

saricles/MiniMax-M2.5-REAP-139B-A10B-NVFP4-GB10

Text Generation • 79B • Updated 15 days ago • 431 • 3

AxionML/Qwen3.5-122B-A10B-NVFP4

Image-Text-to-Text • 62B • Updated 14 days ago • 1.37k • 3

rene98c/Qwen3.5-397B-A17B-REAP-28-NVFP4

Text Generation • Updated 5 days ago • 545 • 2

nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4

Text Generation • Updated Feb 9 • 71.8k • 34

cybermotaz/Qwen3-Omni-30B-A3B-Instruct-NVFP4

Text Generation • Updated Dec 25, 2025 • 5

nvidia/DeepSeek-V3.2-NVFP4

Text Generation • 394B • Updated Jan 21 • 26.2k • 8

nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4

Text Generation • Updated Jan 30 • 1.14k • 6

nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 120B • Updated Jan 30 • 4.32k • 6

Firworks/atom-27b-nvfp4

18B • Updated Jan 8 • 8 • 2

Firworks/Aletheia-12B-nvfp4

7B • Updated Jan 20 • 7 • 1

Firworks/SERA-32B-GA-nvfp4

19B • Updated Feb 1 • 4 • 1

Firworks/Step-3.5-Flash-nvfp4

111B • Updated Feb 9 • 335 • 1

alphakek/GLM-4.7-Flash-heretic-NVFP4

Text Generation • 17B • Updated 27 days ago • 317 • 2