Benchmark评测问题?

#55
by TMRBMWK - opened

请问下这个 SWE-Agentless 指的是哪个评测集?
base模型是如何评测的(输入,输出, 怎么verify是否正确?)

Below we present the performance of the base models.

Clipboard_Screenshot_1772184985

Sign up or log in to comment