OCMLOct 24, 2018

First-order Convergence Theory for Weakly-Convex-Weakly-Concave Min-max Problems

arXiv:1810.10207v481 citations
Originality Incremental advance
AI Analysis

This addresses convergence issues in min-max optimization for machine learning applications like GANs, but it is incremental as it builds on existing proximal point methods.

The paper tackles the problem of solving non-convex non-concave min-max problems, such as those in training GANs, by proposing an algorithmic framework based on the inexact proximal point method and proving first-order convergence to a nearly stationary solution with experimental verification.

In this paper, we consider first-order convergence theory and algorithms for solving a class of non-convex non-concave min-max saddle-point problems, whose objective function is weakly convex in the variables of minimization and weakly concave in the variables of maximization. It has many important applications in machine learning including training Generative Adversarial Nets (GANs). We propose an algorithmic framework motivated by the inexact proximal point method, where the weakly monotone variational inequality (VI) corresponding to the original min-max problem is solved through approximately solving a sequence of strongly monotone VIs constructed by adding a strongly monotone mapping to the original gradient mapping. We prove first-order convergence to a nearly stationary solution of the original min-max problem of the generic algorithmic framework and establish different rates by employing different algorithms for solving each strongly monotone VI. Experiments verify the convergence theory and also demonstrate the effectiveness of the proposed methods on training GANs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes