GT LG DSFeb 24, 2022

No-Regret Learning in Games is Turing Complete

Gabriel P. Andrade, Rafael Frongillo, Georgios Piliouras

arXiv:2202.11871v11.2

Originality Highly original

AI Analysis

This is a foundational negative result for multi-agent machine learning, showing that determining convergence to equilibria is computationally undecidable.

The paper proves that learning in games, specifically the replicator dynamic on matrix games, is Turing complete, implying undecidability of reachability problems such as equilibrium convergence.

Games are natural models for multi-agent machine learning settings, such as generative adversarial networks (GANs). The desirable outcomes from algorithmic interactions in these games are encoded as game theoretic equilibrium concepts, e.g. Nash and coarse correlated equilibria. As directly computing an equilibrium is typically impractical, one often aims to design learning algorithms that iteratively converge to equilibria. A growing body of negative results casts doubt on this goal, from non-convergence to chaotic and even arbitrary behaviour. In this paper we add a strong negative result to this list: learning in games is Turing complete. Specifically, we prove Turing completeness of the replicator dynamic on matrix games, one of the simplest possible settings. Our results imply the undecicability of reachability problems for learning algorithms in games, a special case of which is determining equilibrium convergence.

View on arXiv PDF

Similar