LG MA MLJun 11, 2020

Learning Individually Inferred Communication for Multi-Agent Cooperation

arXiv:2006.06455v221.3138 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses communication inefficiencies for multi-agent reinforcement learning systems, representing an incremental improvement over prior work.

The paper tackled the problem of inefficient broadcast communication in multi-agent cooperation by proposing Individually Inferred Communication (I2C), which reduces communication overhead and improves performance in various cooperative scenarios compared to existing methods.

Communication lays the foundation for human cooperation. It is also crucial for multi-agent cooperation. However, existing work focuses on broadcast communication, which is not only impractical but also leads to information redundancy that could even impair the learning process. To tackle these difficulties, we propose Individually Inferred Communication (I2C), a simple yet effective model to enable agents to learn a prior for agent-agent communication. The prior knowledge is learned via causal inference and realized by a feed-forward neural network that maps the agent's local observation to a belief about who to communicate with. The influence of one agent on another is inferred via the joint action-value function in multi-agent reinforcement learning and quantified to label the necessity of agent-agent communication. Furthermore, the agent policy is regularized to better exploit communicated messages. Empirically, we show that I2C can not only reduce communication overhead but also improve the performance in a variety of multi-agent cooperative scenarios, comparing to existing methods. The code is available at https://github.com/PKU-AI-Edge/I2C.

View on arXiv PDF Code

Similar