MAAICLApr 9

More Capable, Less Cooperative? When LLMs Fail At Zero-Cost Collaboration

arXiv:2604.0782119.71 citations
Predicted impact top 12% in MA · last 90 daysOriginality Incremental advance
AI Analysis

This highlights a critical coordination failure in multi-agent systems for AI developers, showing that scaling intelligence alone won't solve cooperation issues without deliberate design.

The study investigated whether LLM agents cooperate when helping others has no personal cost, finding that more capable models like OpenAI o3 achieved only 17% of optimal collective performance, while less capable ones like o3-mini reached 50%, despite identical instructions.

Large language model (LLM) agents increasingly coordinate in multi-agent systems, yet we lack an understanding of where and why cooperation failures may arise. In many real-world coordination problems, from knowledge sharing in organizations to code documentation, helping others carries negligible personal cost while generating substantial collective benefits. However, whether LLM agents cooperate when helping neither benefits nor harms the helper, while being given explicit instructions to do so, remains unknown. We build a multi-agent setup designed to study cooperative behavior in a frictionless environment, removing all strategic complexity from cooperation. We find that capability does not predict cooperation: OpenAI o3 achieves only 17% of optimal collective performance while OpenAI o3-mini reaches 50%, despite identical instructions to maximize group revenue. Through a causal decomposition that automates one side of agent communication, we separate cooperation failures from competence failures, tracing their origins through agent reasoning analysis. Testing targeted interventions, we find that explicit protocols double performance for low-competence models, and tiny sharing incentives improve models with weak cooperation. Our findings suggest that scaling intelligence alone will not solve coordination problems in multi-agent systems and will require deliberate cooperative design, even when helping others costs nothing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes