MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper ⢠2510.15414 ⢠Published ⢠1
MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs š Accepted by ICLR 2026
Note Note: This paper has been updated to v3 on arXiv. MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs