nihaomur
/

gemma-2-9b-it-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

nihaomur commited on Jul 18, 2024

Commit

d668c9f

·

verified ·

1 Parent(s): 1ed6b8e

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -1,6 +1,9 @@
 This is not the original model I made, it's google's [Gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it) and Quantized by [AutoAWQ](https://github.com/casper-hansen/AutoAWQ).
 I quantized it with 4-bit, your GPU VRAM should be at least 8G in order to garauntee it work perfectly.
-By renning some testing on this AWQ model, this model is significantly brilliant.
 Below is the original model card, hope you guys having fun with it.

 This is not the original model I made, it's google's [Gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it) and Quantized by [AutoAWQ](https://github.com/casper-hansen/AutoAWQ).
 I quantized it with 4-bit, your GPU VRAM should be at least 8G in order to garauntee it work perfectly.
+By running some test on this AWQ model, this model is significantly brilliant.
 Below is the original model card, hope you guys having fun with it.