Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhihang Yuan's picture
2 3

Zhihang Yuan

hahnyuan
dark-pen's profile picture FatemaSiddika's profile picture
·
http://hahnyuan.com/
  • hahnyuan

AI & ML interests

None yet

Organizations

None yet

authored 7 papers over 1 year ago

LLM Inference Unveiled: Survey and Roofline Model Insights

Paper • 2402.16363 • Published Feb 26, 2024 • 4

Post-training Quantization on Diffusion Models

Paper • 2211.15736 • Published Nov 28, 2022

PB-LLM: Partially Binarized Large Language Models

Paper • 2310.00034 • Published Sep 29, 2023 • 2

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

Paper • 2402.03666 • Published Feb 6, 2024

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More

Paper • 2402.12065 • Published Feb 19, 2024

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12, 2024 • 25

PD-Quant: Post-Training Quantization based on Prediction Difference Metric

Paper • 2212.07048 • Published Dec 14, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs