MAJun 5

Learning Multi-Agent Communication Protocol: Study on Information Entropy Efficiency in MARL

arXiv:2606.0720013.9

Originality Incremental advance

AI Analysis

For researchers in multi-agent reinforcement learning, this work provides a principled metric to balance performance and communication efficiency, challenging the assumption that complex architectures are necessary for performance gains.

The paper introduces the Information Entropy Efficiency Index (IEI), a metric to evaluate communication efficiency in multi-agent reinforcement learning, and shows that incorporating IEI into training yields equivalent or superior task performance with improved communication efficiency.

Multi-Agent Systems (MAS) have emerged as a fundamental paradigm for distributed problem-solving, where autonomous agents collaborate to achieve complex objectives. Within this framework, Multi-Agent Reinforcement Learning (MARL) with communication has demonstrated remarkable success in cooperative tasks. However, existing approaches predominantly pursue performance gains through increasingly complex architectures and expanding communication overhead, lacking principled metrics to evaluate the efficiency of information exchange. In this paper, we focus on enabling agents to learn efficient multi-agent communication protocols that balance performance and information compactness. We propose the Information Entropy Efficiency Index (IEI), a novel metric that quantifies the ratio between message entropy and task performance in learned communication protocols. A lower IEI indicates more compact and efficient message representations. By incorporating IEI into training loss functions, we encourage agents to develop communication protocols that achieve high performance with improved communication efficiency. Extensive experiments across diverse MARL algorithms demonstrate that our approach achieves equivalent or superior task performance compared to baseline methods while improving communication efficiency. These findings challenge the prevailing assumption that performance improvements require complex architectures or increased communication overhead and highlight the potential of improving both task success and communication efficiency to enable scalable MAS.

View on arXiv PDF

Similar