MA AISep 11, 2019

On Memory Mechanism in Multi-Agent Reinforcement Learning

Yilun Zhou, Derrik E. Asher, Nicholas R. Waytowich, Julie A. Shah

arXiv:1909.05232v12.32 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of designing effective algorithms for multi-agent systems, particularly in partially observable or communication-limited environments, but it is incremental as it builds on prior work without introducing a new method.

The paper investigates the utility of memory mechanisms in multi-agent reinforcement learning, showing they are beneficial when agents need to model others or communication is constrained, but cautioning that agents might achieve memoryfulness through alternative means.

Multi-agent reinforcement learning (MARL) extends (single-agent) reinforcement learning (RL) by introducing additional agents and (potentially) partial observability of the environment. Consequently, algorithms for solving MARL problems incorporate various extensions beyond traditional RL methods, such as a learned communication protocol between cooperative agents that enables exchange of private information or adaptive modeling of opponents in competitive settings. One popular algorithmic construct is a memory mechanism such that an agent's decisions can depend not only upon the current state but also upon the history of observed states and actions. In this paper, we study how a memory mechanism can be useful in environments with different properties, such as observability, internality and presence of a communication channel. Using both prior work and new experiments, we show that a memory mechanism is helpful when learning agents need to model other agents and/or when communication is constrained in some way; however we must to be cautious of agents achieving effective memoryfulness through other means.

View on arXiv PDF

Similar