Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
10
Ziqi wang
wzq016
Follow
zeyang1999's profile picture
1 follower
·
1 following
https://wzq016.github.io
wzq016
wzq016
AI & ML interests
NLP
Organizations
wzq016
's models
41
Sort: Recently updated
wzq016/qwen25-entrie-guideline-8k
8B
•
Updated
Apr 6, 2025
wzq016/qwen25-rlrm-filtered-guideline
8B
•
Updated
Apr 4, 2025
wzq016/qwen25-rlrm-entire-guideline
8B
•
Updated
Apr 4, 2025
wzq016/llama3-skywork-rlrm-code-math-grpo-kl
8B
•
Updated
Mar 27, 2025
wzq016/qwen25-skywork-rlrm-code-math-grpo-kl
8B
•
Updated
Mar 26, 2025
•
1
wzq016/llama3-skywork-rlrm-new-filtered-grpo-kl
8B
•
Updated
Mar 22, 2025
•
1
wzq016/llama3-skywork-rlrm-new-filtered-code-grpo-kl
8B
•
Updated
Mar 22, 2025
wzq016/llama3-skywork-rlrm-filtered-code-grpo-kl
8B
•
Updated
Mar 21, 2025
wzq016/llama3-skywork-rlrm-filtered-grpo-kl
8B
•
Updated
Mar 21, 2025
•
1
wzq016/llama3-skywork-sft-rlrm
8B
•
Updated
Mar 21, 2025
•
1
wzq016/llama3-skywork-rlrm
8B
•
Updated
Mar 21, 2025
Previous
1
2
Next