Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
embedl
/
Llama-3.2-3B-Instruct-FlashHead-W4A16
like
4
Follow
Embedl
12
Safetensors
flash_head_llama
text-generation-inference
custom_code
compressed-tensors
License:
embedl-models-community-licence-1.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.2-3B-Instruct-FlashHead-W4A16
/
modeling_flash_head_llama.py
swaze
Upload 3 files
006a2e6
verified
2 days ago
raw
Copy download link
history
blame
contribute
delete
Safe
78 Bytes
from
embedl.models.llama.modeling_flash_head
import
FlashHeadLlamaForCausalLM