caiyuchen commited on
Commit
bd07be9
·
verified ·
1 Parent(s): 76513ff

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -18,7 +18,8 @@ base_model:
18
  # On Predictability of Reinforcement Learning Dynamics for Large Language Models
19
 
20
 
21
- ![Paper Overview]
 
22
 
23
 
24
  This repository provides one of the models used in our paper **"On Predictability of Reinforcement Learning Dynamics for Large Language Models"** for evaluating and predicting reinforcement learning (RL) dynamics in large language models (LLMs).
 
18
  # On Predictability of Reinforcement Learning Dynamics for Large Language Models
19
 
20
 
21
+ ![Overview](https://huggingface.co/caiyuchen/DAPO-step-0/tree/main/overview.png)
22
+
23
 
24
 
25
  This repository provides one of the models used in our paper **"On Predictability of Reinforcement Learning Dynamics for Large Language Models"** for evaluating and predicting reinforcement learning (RL) dynamics in large language models (LLMs).