OC GT LGMay 28, 2019

Competitive Gradient Descent

arXiv:1905.12103v329.3118 citationsh-index: 57Has Code

Originality Highly original

AI Analysis

This addresses a fundamental problem in game theory and multi-agent learning, offering a more stable and efficient algorithm for practitioners in these fields.

The authors tackled the problem of computing Nash equilibria in competitive two-player games by introducing Competitive Gradient Descent, which avoids oscillatory/divergent behaviors seen in prior methods and achieves exponential local convergence for convex-concave games. Their method shows robust convergence without stepsize adaptation and allows larger stepsizes for faster convergence in numerical experiments.

We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-player games. Our method is a natural generalization of gradient descent to the two-player setting where the update is given by the Nash equilibrium of a regularized bilinear local approximation of the underlying game. It avoids oscillatory and divergent behaviors seen in alternating gradient descent. Using numerical experiments and rigorous analysis, we provide a detailed comparison to methods based on \emph{optimism} and \emph{consensus} and show that our method avoids making any unnecessary changes to the gradient dynamics while achieving exponential (local) convergence for (locally) convex-concave zero sum games. Convergence and stability properties of our method are robust to strong interactions between the players, without adapting the stepsize, which is not the case with previous methods. In our numerical experiments on non-convex-concave problems, existing methods are prone to divergence and instability due to their sensitivity to interactions among the players, whereas we never observe divergence of our algorithm. The ability to choose larger stepsizes furthermore allows our algorithm to achieve faster convergence, as measured by the number of model evaluations.

View on arXiv PDF Code

Similar