Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nm-testing 's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models

LLM Compressor testing

updated 26 days ago
Upvote
-

  • nm-testing/tinysmokellama-3.2

    354k • Updated Sep 17 • 27.7k

  • nm-testing/llama2.c-stories42M-pruned2.4

    Updated Oct 29 • 430

  • nm-testing/tinyllama-fp8-dynamic-compressed

    1B • Updated Oct 9, 2024 • 404

  • nm-testing/tinyllama-w4a16-compressed

    0.3B • Updated Oct 9, 2024 • 690

  • nm-testing/tinyllama-w8a8-compressed

    1B • Updated Oct 9, 2024 • 1.03k

  • nm-testing/tinyllama-w8a16-dense

    1B • Updated Oct 9, 2024 • 155

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-compressed

    1B • Updated Jan 14 • 601

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-uncompressed

    1B • Updated Jan 14 • 177

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-compressed

    0.3B • Updated Jan 14 • 40

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-uncompressed

    1B • Updated Jan 14 • 17

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed

    1B • Updated Jan 14 • 41

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed

    1B • Updated Jan 14 • 19

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed

    0.4B • Updated Jan 14 • 609

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed

    1B • Updated Jan 14 • 177
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs