LGGTMar 16, 2023

Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games

arXiv:2303.13539v13 citationsh-index: 30
Originality Incremental advance
AI Analysis

This work addresses a gap in MARL for continuous-space games, which is incremental as it extends existing methods to more general settings.

The paper tackles the problem of multi-agent reinforcement learning in stochastic games with continuous state spaces and unobserved actions, proposing a decentralized algorithm with proven near-optimal policy updates and analyzing convergence probabilities for best-reply algorithms.

Stochastic games are a popular framework for studying multi-agent reinforcement learning (MARL). Recent advances in MARL have focused primarily on games with finitely many states. In this work, we study multi-agent learning in stochastic games with general state spaces and an information structure in which agents do not observe each other's actions. In this context, we propose a decentralized MARL algorithm and we prove the near-optimality of its policy updates. Furthermore, we study the global policy-updating dynamics for a general class of best-reply based algorithms and derive a closed-form characterization of convergence probabilities over the joint policy space.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes