MAAILGFeb 15, 2018

Mean Field Multi-Agent Reinforcement Learning

arXiv:1802.05438v5698 citations
Originality Incremental advance
AI Analysis

This addresses scalability issues in multi-agent systems for researchers and practitioners, offering a novel approach to handle large populations, though it builds on existing mean field concepts in a hybrid manner.

The paper tackles the intractability of multi-agent reinforcement learning with many agents by introducing Mean Field Reinforcement Learning, which approximates agent interactions using the average effect of the population, and demonstrates its effectiveness in experiments like Gaussian squeeze and battle games, achieving convergence to Nash equilibrium.

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of agent interactions. In this paper, we present \emph{Mean Field Reinforcement Learning} where the interactions within the population of agents are approximated by those between a single agent and the average effect from the overall population or neighboring agents; the interplay between the two entities is mutually reinforced: the learning of the individual agent's optimal policy depends on the dynamics of the population, while the dynamics of the population change according to the collective patterns of the individual policies. We develop practical mean field Q-learning and mean field Actor-Critic algorithms and analyze the convergence of the solution to Nash equilibrium. Experiments on Gaussian squeeze, Ising model, and battle games justify the learning effectiveness of our mean field approaches. In addition, we report the first result to solve the Ising model via model-free reinforcement learning methods.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes