MAAILGSYMLSep 21, 2017

Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning

arXiv:1709.07224v24 citations
Originality Incremental advance
AI Analysis

This addresses the problem of enabling efficient swarm behavior learning for multi-robot systems, though it is incremental as it builds on existing deep reinforcement learning methods with new protocols.

The paper tackles the challenge of learning decentralized control policies for swarm systems with limited local sensing and communication by proposing simple communication protocols based on histograms that encode local neighborhood relations and task-specific information, and it uses an adaptation of Trust Region Policy Optimization to achieve complex collaborative tasks like formation building and building a communication link in a simulated 2D-physics environment.

Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents. While it is often difficult to directly define the behavior of the agents, simple communication protocols can be defined more easily using prior knowledge about the given task. In this paper, we propose a number of simple communication protocols that can be exploited by deep reinforcement learning to find decentralized control policies in a multi-robot swarm environment. The protocols are based on histograms that encode the local neighborhood relations of the agents and can also transmit task-specific information, such as the shortest distance and direction to a desired target. In our framework, we use an adaptation of Trust Region Policy Optimization to learn complex collaborative tasks, such as formation building and building a communication link. We evaluate our findings in a simulated 2D-physics environment, and compare the implications of different communication protocols.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes