-
-
-
-
-
-
Inference Providers
Active filters:
cuda
ussoewwin/Flash-Attention-2_for_Windows
Updated
•
70
dougeeai/llama-cpp-python-wheels
sequelbox/Ministral-3-14B-Reasoning-2512-PlumEsper1.1
Image-Text-to-Text
•
14B
•
Updated
•
18
•
3
mradermacher/Ministral-3-14B-Reasoning-2512-PlumEsper1.1-i1-GGUF
14B
•
Updated
•
468
•
3
Hellohal2064/vllm-dgx-spark-gb10
Text Generation
•
Updated
•
1
aydin99/FLUX.2-klein-4B-int8
Text-to-Image
•
Updated
•
583
•
10
bitresurrector/bitResurrector-v3-High-Performance-Recovery
RESMP-DEV/GLM-4.7-Flash-Trellis-MM
Text Generation
•
14B
•
Updated
•
8
•
1
Text Generation
•
Updated
•
9
•
23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
•
Updated
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
•
8B
•
Updated
•
43
•
3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B
•
Updated
•
198
•
2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B
•
Updated
•
478
•
2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
•
2B
•
Updated
•
13
•
5
Triangle104/Qwen3-8B-ShiningValiant3-Q4_K_S-GGUF
Text Generation
•
8B
•
Updated
•
2
Triangle104/Qwen3-8B-ShiningValiant3-Q4_K_M-GGUF
Text Generation
•
8B
•
Updated
•
5
Triangle104/Qwen3-8B-ShiningValiant3-Q5_K_S-GGUF
Text Generation
•
8B
•
Updated
•
8
Triangle104/Qwen3-8B-ShiningValiant3-Q5_K_M-GGUF
Text Generation
•
8B
•
Updated
•
3
Triangle104/Qwen3-8B-ShiningValiant3-Q6_K-GGUF
Text Generation
•
8B
•
Updated
Triangle104/Qwen3-8B-ShiningValiant3-Q8_0-GGUF
Text Generation
•
8B
•
Updated
•
10
Triangle104/Qwen3-1.7B-ShiningValiant3-Q4_K_S-GGUF
Text Generation
•
2B
•
Updated
•
2
Triangle104/Qwen3-1.7B-ShiningValiant3-Q4_K_M-GGUF
Text Generation
•
2B
•
Updated
•
1
Triangle104/Qwen3-1.7B-ShiningValiant3-Q5_K_S-GGUF
Text Generation
•
2B
•
Updated
•
3
Triangle104/Qwen3-1.7B-ShiningValiant3-Q5_K_M-GGUF
Text Generation
•
2B
•
Updated
•
8
Triangle104/Qwen3-1.7B-ShiningValiant3-Q6_K-GGUF
Text Generation
•
2B
•
Updated
•
5
Triangle104/Qwen3-1.7B-ShiningValiant3-Q8_0-GGUF
Text Generation
•
2B
•
Updated
•
12
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B
•
Updated
•
122
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B
•
Updated
•
102