You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

MMLU-PRO: 總分:63.08

根據我們的評估扣除掉部分因為思維鏈導致的格式錯誤，無法判別外，正確率與原始模型相比更高。

PS:

由於我們不知道原始模型如何評估，所以我們重新檢查了，原始模型得分，發現性能比之原始模型更強，根據我們的測試開啟思考模式下，分數比之原版高上三分達到70以上

原版採用(5-shot CoT)評估，我們僅評估zero-shot

Safetensors

Model size

24B params

Tensor type

BF16

Model tree for win10/MagKr-3.2-24B-thinking

Base model

Finetuned

Finetuned

(50)

this model