Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models
LLM Compressor testing
updated
26 days ago
Upvote
-
nm-testing/tinysmokellama-3.2
354k
•
Updated
Sep 17
•
27.7k
nm-testing/llama2.c-stories42M-pruned2.4
Updated
Oct 29
•
430
nm-testing/tinyllama-fp8-dynamic-compressed
1B
•
Updated
Oct 9, 2024
•
404
nm-testing/tinyllama-w4a16-compressed
0.3B
•
Updated
Oct 9, 2024
•
690
nm-testing/tinyllama-w8a8-compressed
1B
•
Updated
Oct 9, 2024
•
1.03k
nm-testing/tinyllama-w8a16-dense
1B
•
Updated
Oct 9, 2024
•
155
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-compressed
1B
•
Updated
Jan 14
•
601
nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-uncompressed
1B
•
Updated
Jan 14
•
177
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-compressed
0.3B
•
Updated
Jan 14
•
40
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-uncompressed
1B
•
Updated
Jan 14
•
17
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed
1B
•
Updated
Jan 14
•
41
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed
1B
•
Updated
Jan 14
•
19
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed
0.4B
•
Updated
Jan 14
•
609
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed
1B
•
Updated
Jan 14
•
177
Upvote
-
Share collection
View history
Collection guide
Browse collections