Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LifelongAlignment
/
Qwen2-0.5B-Instruct_CPPO-REWARD_REWARD_2
like
0
Follow
Lifelong Alignment of Agents
6
Model card
Files
Files and versions
xet
Community
main
Qwen2-0.5B-Instruct_CPPO-REWARD_REWARD_2
1.52 kB
1 contributor
History:
1 commit
avecplezir
initial commit
9445799
verified
8 months ago
.gitattributes
1.52 kB
initial commit
8 months ago