linius commited on
Commit
24f3e3d
·
verified ·
1 Parent(s): 33f0bcf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -24,6 +24,8 @@ language:
24
 
25
  This model was introduced in the paper[*Surgical Post-Training: Cutting Errors, Keeping Knowledge* (Lin & Han, 2026)](https://arxiv.org/abs/2603.01683).
26
 
 
 
27
  ## Training Details & Performance
28
 
29
  - **Efficiency:** The model was trained using merely **4k rectified math data pairs**. It avoids standard multi-phase pipelines (SFT → GRPO → DPO).
@@ -72,4 +74,19 @@ generated_ids =[
72
  ]
73
 
74
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
75
- print(response)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
 
25
  This model was introduced in the paper[*Surgical Post-Training: Cutting Errors, Keeping Knowledge* (Lin & Han, 2026)](https://arxiv.org/abs/2603.01683).
26
 
27
+ - **Code Repository:** [Visual-AI/SPoT](https://github.com/Visual-AI/SPoT)
28
+
29
  ## Training Details & Performance
30
 
31
  - **Efficiency:** The model was trained using merely **4k rectified math data pairs**. It avoids standard multi-phase pipelines (SFT → GRPO → DPO).
 
74
  ]
75
 
76
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
77
+ print(response)
78
+ ```
79
+
80
+ ## Citation
81
+
82
+ If you find our model, data, or the SPoT methodology useful in your research, please consider citing our paper:
83
+
84
+ **BibTeX:**
85
+ ```bibtex
86
+ @article{lin2026surgical,
87
+ title={Surgical Post-Training: Cutting Errors, Keeping Knowledge},
88
+ author={Wenye Lin and Kai Han},
89
+ year={2026},
90
+ journal={arXiv preprint arXiv:2603.01683}
91
+ }
92
+ ```