MA AI CY ET LGFeb 19, 2025

Multi-Agent Risks from Advanced AI

Lewis Hammond, Alan Chan, Jesse Clifton, Jason Hoelscher-Obermaier, Akbir Khan, Euan McLean, Chandler Smith, Wolfram Barfuss, Jakob Foerster, Tomáš Gavenčiak, The Anh Han, Edward Hughes

Stanford

arXiv:2502.14143v129.3155 citationsh-index: 55

Originality Synthesis-oriented

AI Analysis

This addresses safety, governance, and ethical challenges for stakeholders in AI development and deployment, but it is incremental as it builds on existing risk analysis frameworks.

The paper tackles the problem of novel risks from complex multi-agent AI systems by providing a structured taxonomy of failure modes and risk factors, illustrating these with real-world examples and experimental evidence.

The rapid development of advanced AI agents and the imminent deployment of many instances of these agents will give rise to multi-agent systems of unprecedented complexity. These systems pose novel and under-explored risks. In this report, we provide a structured taxonomy of these risks by identifying three key failure modes (miscoordination, conflict, and collusion) based on agents' incentives, as well as seven key risk factors (information asymmetries, network effects, selection pressures, destabilising dynamics, commitment problems, emergent agency, and multi-agent security) that can underpin them. We highlight several important instances of each risk, as well as promising directions to help mitigate them. By anchoring our analysis in a range of real-world examples and experimental evidence, we illustrate the distinct challenges posed by multi-agent systems and their implications for the safety, governance, and ethics of advanced AI.

View on arXiv PDF

Similar