GGUF quants?

#17

by fsaudm - opened Dec 29, 2025

•

@unsloth @shimmyshimmer is it DSA what's slowing down release? I'd love to help!

Until support for deepseek 3.2 is implemented in llama.cpp there will be no ggufs.

Jan 8

There is a working GGUF for the Thinking version (not Speciale extra thinking so far) here now: https://huggingface.co/sszymczyk/DeepSeek-V3.2-nolight-GGUF

It does not use the new sparse attention stuff, but basically runs same as earlier version.

Don't know

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment