GT AI LGJan 22, 2023

Abstracting Imperfect Information Away from Two-Player Zero-Sum Games

Samuel Sokota, Ryan D'Orazio, Chun Kai Ling, David J. Wu, J. Zico Kolter, Noam Brown

arXiv:2301.09159v37.38 citationsh-index: 71

Originality Highly original

AI Analysis

This provides a new perspective for solving two-player zero-sum games, potentially improving computational efficiency and simplifying algorithms for researchers and practitioners in game theory and AI.

The paper tackles the problem of imperfect information in two-player zero-sum games by showing that certain regularized equilibria can be abstracted as perfect-information problems, enabling simplified decision-time planning algorithms without the drawbacks of existing methods.

In their seminal work, Nayyar et al. (2013) showed that imperfect information can be abstracted away from common-payoff games by having players publicly announce their policies as they play. This insight underpins sound solvers and decision-time planning algorithms for common-payoff games. Unfortunately, a naive application of the same insight to two-player zero-sum games fails because Nash equilibria of the game with public policy announcements may not correspond to Nash equilibria of the original game. As a consequence, existing sound decision-time planning algorithms require complicated additional mechanisms that have unappealing properties. The main contribution of this work is showing that certain regularized equilibria do not possess the aforementioned non-correspondence problem -- thus, computing them can be treated as perfect-information problems. Because these regularized equilibria can be made arbitrarily close to Nash equilibria, our result opens the door to a new perspective to solving two-player zero-sum games and yields a simplified framework for decision-time planning in two-player zero-sum games, void of the unappealing properties that plague existing decision-time planning approaches.

View on arXiv PDF

Similar