Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
akhauriyash
/
DDR1_Q1.5B-GRPOFixReward
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
DDR1_Q1.5B-GRPOFixReward
/
training_args.bin
Commit History
Training in progress, step 20
07bcaf2
verified
akhauriyash
commited on
Jan 20
Training in progress, step 20
7b6294f
verified
akhauriyash
commited on
Nov 21, 2025