Quantization
#1
by
hadadrjt - opened
Are there any plans for quantization, such as 2-bit and 4-bit with Ollama? This could reduce resource usage.
Yeah
Are there any plans for quantization, such as 2-bit and 4-bit with Ollama? This could reduce resource usage.
Yeah