Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
movefast
/
Qwen2.5-7B-Instruct-GRPO
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-7B-Instruct-GRPO
/
training_args.bin
Commit History
Training in progress, step 282
27be9a7
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 312
caf11bd
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 235
deca323
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 234
53eef7e
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 188
49b2ad8
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 195
8dd3998
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 141
e5da1c3
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 156
72fc392
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 94
985e7b3
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 78
ac9f32c
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 47
07e6291
verified
movefast
commited on
Mar 28, 2025
Training in progress, step 39
2059cfe
verified
movefast
commited on
Mar 28, 2025