hZzy
/

qwen2.5-0.5b-expo-DPO-ES-0.1

alignment-handbook

Generated from Trainer

Model card Files Files and versions

qwen2.5-0.5b-expo-DPO-ES-0.1

1.99 GB

1 contributor

History: 41 commits

This model has 1 file scanned as suspicious.

hZzy's picture

End of training

ee26d68 verified 12 months ago