请问下这个 SWE-Agentless 指的是哪个评测集?base模型是如何评测的(输入,输出, 怎么verify是否正确?)
Below we present the performance of the base models.
· Sign up or log in to comment