AIMar 18

AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse

arXiv:2603.202855.8h-index: 1

Predicted impact top 92% in AI · last 90 daysOriginality Synthesis-oriented

AI Analysis

This addresses a critical gap for deploying cooperative AI in real-world robotics and autonomous systems, though it is incremental as it focuses on benchmarking rather than proposing a fundamentally new method.

The paper tackles the problem that cooperative multi-agent embodied AI methods are typically evaluated under idealized communication conditions, which do not reflect real-world impairments like latency and packet loss, and introduces AgentComm-Bench to stress-test these systems, revealing catastrophic performance drops such as over 96% in navigation tasks under bandwidth collapse.

Cooperative multi-agent methods for embodied AI are almost universally evaluated under idealized communication: zero latency, no packet loss, and unlimited bandwidth. Real-world deployment on robots with wireless links, autonomous vehicles on congested networks, or drone swarms in contested spectrum offers no such guarantees. We introduce AgentComm-Bench, a benchmark suite and evaluation protocol that systematically stress-tests cooperative embodied AI under six communication impairment dimensions: latency, packet loss, bandwidth collapse, asynchronous updates, stale memory, and conflicting sensor evidence. AgentComm-Bench spans three task families: cooperative perception, multi-agent waypoint navigation, and cooperative zone search, and evaluates five communication strategies, including a lightweight method we propose based on redundant message coding with staleness-aware fusion. Our experiments reveal that communication-dependent tasks degrade catastrophically: stale memory and bandwidth collapse cause over 96% performance drops in navigation, while content corruption (stale or conflicting data) reduces perception F1 by over 85%. Vulnerability depends on the interaction between impairment type and task design; perception fusion is robust to packet loss but amplifies corrupted data. Redundant message coding more than doubles navigation performance under 80% packet loss. We release AgentComm-Bench as a practical evaluation protocol and recommend that cooperative embodied AI work report performance under multiple impairment conditions.

View on arXiv PDF

Similar