MA AI GT LG SYDec 31, 2020

Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents

arXiv:2012.15377v15.97 citations

Originality Incremental advance

AI Analysis

This work provides a method for finding equilibrium in complex multi-agent systems, particularly relevant for scenarios like cyber-attacks, which is an incremental step for researchers in multi-agent reinforcement learning.

This paper addresses multi-agent Markov strategic interactions with multiple agent types, modeling them as a mean-field game. It characterizes the existence of a stationary multi-type Mean Field Equilibrium (MMFE) and proposes a policy gradient-based Reinforcement Learning algorithm to compute it when dynamics are unknown, demonstrating convergence in a cyber-attack scenario.

We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types. We model the strategic interaction as a mean-field game in the asymptotic limit when the number of agents of each type becomes infinite. Each agent has a private state; the state evolves depending on the distribution of the state of the agents of different types and the action of the agent. Each agent wants to maximize the discounted sum of rewards over the infinite horizon which depends on the state of the agent and the distribution of the state of the leaders and followers. We seek to characterize and compute a stationary multi-type Mean field equilibrium (MMFE) in the above game. We characterize the conditions under which a stationary MMFE exists. Finally, we propose Reinforcement learning (RL) based algorithm using policy gradient approach to find the stationary MMFE when the agents are unaware of the dynamics. We, numerically, evaluate how such kind of interaction can model the cyber attacks among defenders and adversaries, and show how RL based algorithm can converge to an equilibrium.

View on arXiv PDF

Similar