Update README.md
Browse files
README.md
CHANGED
|
@@ -48,7 +48,7 @@ DeepSWE-Verifier is a fine-tuned/SFT version of [Qwen/Qwen3-14B](https://hugging
|
|
| 48 |
Discover more about DeepSWE-Preview's development and capabilities in our [technical blog post](www.google.com).
|
| 49 |
|
| 50 |
<div style="margin: 0 auto;">
|
| 51 |
-
<img src="https://cdn-
|
| 52 |
<p align="center" style="margin-top: 8px; font-style: italic; color: #666;">
|
| 53 |
Figure 1: SWE-Bench Verified Performance w.r.t. different TTS strategies. With hybrid TTS, DeepSWE-Preview achieves 59%, beating the current SOTA open-weights model (SkyWork + TTS, 47%) by 12%. We note that only using execution-based and execution-free verifiers is still effective and can bring 10+% performance.
|
| 54 |
</p>
|
|
|
|
| 48 |
Discover more about DeepSWE-Preview's development and capabilities in our [technical blog post](www.google.com).
|
| 49 |
|
| 50 |
<div style="margin: 0 auto;">
|
| 51 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/654037be97949fd2304aab7f/a7urAV3isk73ZkIbu3d7s.png" style="width: 100%;" />
|
| 52 |
<p align="center" style="margin-top: 8px; font-style: italic; color: #666;">
|
| 53 |
Figure 1: SWE-Bench Verified Performance w.r.t. different TTS strategies. With hybrid TTS, DeepSWE-Preview achieves 59%, beating the current SOTA open-weights model (SkyWork + TTS, 47%) by 12%. We note that only using execution-based and execution-free verifiers is still effective and can bring 10+% performance.
|
| 54 |
</p>
|