Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yukang
/
Qwen2.5-32B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-32B-Open-R1-GRPO
/
training_args.bin
Commit History
Training in progress, step 5
8004b9e
verified
Yukang
commited on
Sep 10, 2025
Training in progress, step 50
87e4b50
verified
Yukang
commited on
Sep 9, 2025
Training in progress, step 25
26fea1c
verified
Yukang
commited on
Sep 9, 2025
Training in progress, step 45
270d498
verified
Yukang
commited on
Sep 9, 2025
Training in progress, step 20
f7a1e99
verified
Yukang
commited on
Sep 9, 2025
Training in progress, step 40
2f43cc1
verified
Yukang
commited on
Sep 9, 2025
Training in progress, step 25
6e05189
verified
Yukang
commited on
Sep 8, 2025
Training in progress, step 5
3af52f1
verified
Yukang
commited on
Sep 8, 2025
Training in progress, step 5
94eff78
verified
Yukang
commited on
Sep 8, 2025
Training in progress, step 200
db3beec
verified
Yukang
commited on
Sep 7, 2025
Training in progress, step 100
11a0725
verified
Yukang
commited on
Sep 7, 2025
Training in progress, step 20
99a762d
verified
Yukang
commited on
Sep 7, 2025