-
-
-
-
-
-
Inference Providers
Active filters:
quark
Quark-NPU-Workshop/po-phi3-mini-4k-ins
0.6B
•
Updated
EmbeddedLLM/Qwen3-VL-235B-A22B-Instruct-FP8-PTPC-Quark
236B
•
Updated
•
2
amd/granite-4.0-h-small-fp8
32B
•
Updated
•
109
haoyang-amd/Qwen1.5-MoE-A2.7B-ptpc
14B
•
Updated
•
3
amd/Qwen3-30B-A3B-Thinking-2507-ptpc
31B
•
Updated
•
13
•
1
amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8
11B
•
Updated
•
148
amd/gpt-oss-20b-WFP8-AFP8-KVFP8
21B
•
Updated
•
140
amd/Qwen3-VL-235B-A22B-Instruct-ptpc
236B
•
Updated
•
3
amd/DeepSeek-R1-0528-ptpc
671B
•
Updated
•
4
amd/DeepSeek-R1-0528-mtp-ptpc
684B
•
Updated
•
22
amd/DeepSeek-V3.2-mtp-ptpc
686B
•
Updated
•
7
amd/Kimi-K2-Thinking-W4A8
Text Generation
•
Updated
•
292
pzhang56/Qwen2.5-1.5B-AMD-FP8
2B
•
Updated
•
1
amd/DeepSeek-V3.2-Speciale-mtp-ptpc
686B
•
Updated
•
10
amd/Qwen3-235B-A22B-Thinking-2507-ptpc
235B
•
Updated
•
900
amd/Kimi-K2-Thinking-MXFP4
551B
•
Updated
•
446
amd/Qwen3-235B-A22B-Instruct-2507-MXFP4
Text Generation
•
118B
•
Updated
•
1.41k
•
2
185B
•
Updated
•
199
•
1
amd-quark/internal-testing-qwen3_0.6b-mxfp4-hadamard
0.5B
•
Updated
•
32
amd-quark/internal-testing-qwen3_0.6b-fp8-hadamard
0.8B
•
Updated
•
31
amd-quark/internal-testing-qwen3_0.6b-int8-hadamard
0.8B
•
Updated
•
19
amd-quark/internal-testing-qwen3_0.6b-mxfp4-tuned-orthogonal
0.5B
•
Updated
•
26
amd-quark/internal-testing-qwen3_0.6b-fp8-tuned-orthogonal
0.8B
•
Updated
•
27
EmbeddedLLM/Qwen3-30B-A3B-Instruct-2507-MXFP4
16B
•
Updated
•
56
377B
•
Updated
•
288
amd/Qwen3-Coder-480B-A35B-Instruct-MXFP4
Text Generation
•
246B
•
Updated
•
1
amd/Kimi-K2-Instruct-0905-MXFP4
551B
•
Updated
•
824
•
1
Text Generation
•
116B
•
Updated
•
380
•
1
2B
•
Updated
•
75
551B
•
Updated
•
55