Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
WorldPM-72B-RLHFLow
like
10
Follow
Qwen
70.3k
Text Classification
Transformers
Safetensors
RLHFlow/pair_data_v2_80K_wsafety
English
qwen2
feature-extraction
Modeling World Preference
WorldPM
reward model
preference model
preference model pretraining
PMP
custom_code
text-embeddings-inference
arxiv:
2505.10527
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
WorldPM-72B-RLHFLow
Commit History
Update modeling_qwen2_rm.py
da08c94
verified
littlebird13
commited on
May 17, 2025
Update README.md
52a42ed
verified
refrain-wbh
commited on
May 16, 2025
Update README.md
cd66e7f
verified
littlebird13
commited on
May 16, 2025
Create LICENSE
f946fa0
verified
littlebird13
commited on
May 16, 2025
Delete configuration.json
c6b46bf
verified
littlebird13
commited on
May 16, 2025
Create README.md
5b325a4
verified
littlebird13
commited on
May 16, 2025
Delete .mv
71f18f9
verified
littlebird13
commited on
May 16, 2025
Delete .msc
c1e50ef
verified
littlebird13
commited on
May 16, 2025
Add files using upload-large-folder tool
b50cb5b
verified
littlebird13
commited on
May 16, 2025
initial commit
168ebe0
verified
clonefy
commited on
May 16, 2025