AI GTMar 13, 2021

Online Double Oracle

Le Cong Dinh, Yaodong Yang, Stephen McAleer, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou Ammar, Jun Wang

arXiv:2103.07780v520.235 citations

Originality Highly original

AI Analysis

This addresses a critical challenge in economics, operations research, and AI for scenarios with huge strategy spaces, offering a more efficient solution for strategic decision-making.

The paper tackles the problem of solving two-player zero-sum normal-form games with prohibitively large action spaces by proposing the Online Double Oracle (ODO) algorithm, which combines no-regret analysis with Double Oracle methods and provably converges to a Nash equilibrium with a regret bound of O(√(T k log(k))), outperforming existing methods like DO, PSRO, and Multiplicative Weight Update in convergence rate and average payoff on real-world games.

Solving strategic games with huge action space is a critical yet under-explored topic in economics, operations research and artificial intelligence. This paper proposes new learning algorithms for solving two-player zero-sum normal-form games where the number of pure strategies is prohibitively large. Specifically, we combine no-regret analysis from online learning with Double Oracle (DO) methods from game theory. Our method -- \emph{Online Double Oracle (ODO)} -- is provably convergent to a Nash equilibrium (NE). Most importantly, unlike normal DO methods, ODO is \emph{rationale} in the sense that each agent in ODO can exploit strategic adversary with a regret bound of $\mathcal{O}(\sqrt{T k \log(k)})$ where $k$ is not the total number of pure strategies, but rather the size of \emph{effective strategy set} that is linearly dependent on the support size of the NE. On tens of different real-world games, ODO outperforms DO, PSRO methods, and no-regret algorithms such as Multiplicative Weight Update by a significant margin, both in terms of convergence rate to a NE and average payoff against strategic adversaries.

View on arXiv PDF

Similar