Inference Providers
Active filters: fp4
Text Generation
• 435B • Updated • 3.42k
• 9
Kbenkhaled/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 33.3k
• 27
nvidia/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 141k
• 73
txn545/Qwen3.5-122B-A10B-NVFP4
Text Generation
• 64B • Updated • 259k
• 19
lyf/Huihui-Qwen3.5-27B-abliterated-NVFP4
Image-Text-to-Text
• 17B • Updated • 1.27k
• 4
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 15.1k
• 14
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4
Text Generation
• 229B • Updated • 696
• 5
berkerdooo/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 1.36k
• 2
Text Generation
• 19B • Updated • 35.3k
• 7
RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4
Text Generation
• 64B • Updated • 10.7k
• 1
nm-testing/DeepSeek-R1-Distill-Qwen-32B-NVFP4
Text Generation
• 19B • Updated • 388
• 2
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
• 133B • Updated • 12.4k
• 15
ussoewwin/Hybrid-Sensitivity-Weighted-Quantization-SDXL-fp8e4m3
Text-to-Image
• Updated • 6
Firworks/Step-3.5-Flash-nvfp4
111B • Updated • 112
• 1
tacos4me/Step-3.5-Flash-NVFP4
Text Generation
• 111B • Updated • 2k
• 9
txn545/Qwen3.5-35B-A3B-NVFP4
Text Generation
• Updated • 25.9k
• 4
Kbenkhaled/Qwen3.5-35B-A3B-NVFP4
Image-Text-to-Text
• Updated • 28.4k
• 8
Image-Text-to-Text
• 8B • Updated • 1.18k
• 1
Text-to-Image
• Updated • 188
• 1
rene98c/Qwen3.5-397B-A17B-REAP-28-NVFP4
Text Generation
• Updated • 720
• 3
lukealonso/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 8.46k
• 4
mengqin1/RedidreamNSFWI1-bnb-4bit
Updated
Text Generation
• 19B • Updated • 3
qingcheng-ai/Qwen3-32B-fp4
Text Generation
• 19B • Updated • 64
• 4
qingcheng-ai/Qwen3-8B-fp4
Text Generation
• 5B • Updated • 15
• 1
RedHatAI/Qwen3-30B-A3B-NVFP4
Text Generation
• 17B • Updated • 14.2k
• 2
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
• 41B • Updated • 1.85k
RedHatAI/Llama-3.1-70B-Instruct-NVFP4A16
Text Generation
• 41B • Updated • 2
RedHatAI/Qwen3-32B-NVFP4A16
Text Generation
• 19B • Updated • 822
• 2
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
• 133B • Updated • 6.93k
• 14