caiyuchen commited on
Commit
73a2606
·
verified ·
1 Parent(s): 21ae9a9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ This model is a reinforcement learning fine-tuned version of **Qwen3-8B-Base**,
16
  - **Base Model**: Qwen3-8B-Base
17
  - **Training Method**: Reinforcement Learning (DAPO)
18
  - **Dataset**: DAPO-Math-17k
19
- - **Checkpoint**: global_step_0(no RL training, i.e. Qwen3-8B-Base)
20
 
21
  ---
22
 
 
16
  - **Base Model**: Qwen3-8B-Base
17
  - **Training Method**: Reinforcement Learning (DAPO)
18
  - **Dataset**: DAPO-Math-17k
19
+ - **Checkpoint**: global_step_0 (no RL training, i.e. Qwen3-8B-Base)
20
 
21
  ---
22