MAAILGSYAug 4, 2025

Communicating Plans, Not Percepts: Scalable Multi-Agent Coordination with Embodied World Models

arXiv:2508.02912v31 citationsh-index: 1
Originality Incremental advance
AI Analysis

This addresses coordination challenges in multi-agent systems, offering an incremental improvement by integrating structured models into reinforcement learning for better scalability.

The paper tackles the problem of multi-agent coordination under partial observability by comparing learned and engineered communication strategies, finding that an engineered approach using embodied world models yields superior performance, sample efficiency, and scalability in complex environments.

Robust coordination is critical for effective decision-making in multi-agent systems, especially under partial observability. A central question in Multi-Agent Reinforcement Learning (MARL) is whether to engineer communication protocols or learn them end-to-end. We investigate this dichotomy using embodied world models. We propose and compare two communication strategies for a cooperative task-allocation problem. The first, Learned Direct Communication (LDC), learns a protocol end-to-end. The second, Intention Communication, uses an engineered inductive bias: a compact, learned world model, the Imagined Trajectory Generation Module (ITGM), which uses the agent's own policy to simulate future states. A Message Generation Network (MGN) then compresses this plan into a message. We evaluate these approaches on goal-directed interaction in a grid world, a canonical abstraction for embodied AI problems, while scaling environmental complexity. Our experiments reveal that while emergent communication is viable in simple settings, the engineered, world model-based approach shows superior performance, sample efficiency, and scalability as complexity increases. These findings advocate for integrating structured, predictive models into MARL agents to enable active, goal-driven coordination.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes