Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-4B-FP8
like
38
Follow
Qwen
74.8k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
5bb0cb1
Qwen3-4B-FP8
Commit History
Remove vLLM FP8 Limitation
5bb0cb1
verified
simon-mo
commited on
Apr 29, 2025
Update README.md
bcd75a3
verified
yangapku
commited on
Apr 28, 2025
Update README.md
35fec96
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
1ef33a9
verified
jklj077
commited on
Apr 28, 2025
Delete special_tokens_map.json
97f8501
verified
littlebird13
commited on
Apr 28, 2025
Delete added_tokens.json
be2fe05
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
c1919f6
verified
littlebird13
commited on
Apr 28, 2025
Update generation_config.json
e66e5a4
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
1d3f2ab
verified
littlebird13
commited on
Apr 28, 2025
Upload folder using huggingface_hub
ae9c71f
verified
littlebird13
commited on
Apr 28, 2025
initial commit
3fdd654
verified
littlebird13
commited on
Apr 28, 2025