LGAIGTOCMay 25, 2022

Learning in Mean Field Games: A Survey

arXiv:2205.12944v486 citationsh-index: 51
AI Analysis

This is an incremental survey that synthesizes existing literature on combining RL and MFGs to tackle scalability issues in game theory for researchers and practitioners.

The survey addresses the challenge of solving large-scale non-cooperative and cooperative games by reviewing Reinforcement Learning (RL) methods applied to Mean Field Games (MFGs), which approximate infinite-player scenarios, and presents numerical illustrations on a benchmark problem.

Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malhamé, Mean Field Games (MFGs) rely on a mean-field approximation to allow the number of players to grow to infinity. Traditional methods for solving these games generally rely on solving partial or stochastic differential equations with a full knowledge of the model. Recently, Reinforcement Learning (RL) has appeared promising to solve complex problems at scale. The combination of RL and MFGs is promising to solve games at a very large scale both in terms of population size and environment complexity. In this survey, we review the quickly growing recent literature on RL methods to learn equilibria and social optima in MFGs. We first identify the most common settings (static, stationary, and evolutive) of MFGs. We then present a general framework for classical iterative methods (based on best-response computation or policy evaluation) to solve MFGs in an exact way. Building on these algorithms and the connection with Markov Decision Processes, we explain how RL can be used to learn MFG solutions in a model-free way. Last, we present numerical illustrations on a benchmark problem, and conclude with some perspectives.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes