OCLGMLNov 24, 2022

Zeroth-Order Alternating Gradient Descent Ascent Algorithms for a Class of Nonconvex-Nonconcave Minimax Problems

arXiv:2211.13668v215 citationsh-index: 10
Originality Highly original
AI Analysis

This addresses optimization challenges in machine learning for problems where gradient information is unavailable, offering the first zeroth-order methods with theoretical guarantees for this class of minimax problems.

The paper tackles nonconvex-nonconcave minimax problems satisfying the Polyak-Łojasiewicz condition by proposing two zeroth-order algorithms, achieving iteration complexities of O(ε^{-2}) and O(ε^{-3}) for deterministic and stochastic settings, respectively.

In this paper, we consider a class of nonconvex-nonconcave minimax problems, i.e., NC-PL minimax problems, whose objective functions satisfy the Polyak-Łojasiewicz (PL) condition with respect to the inner variable. We propose a zeroth-order alternating gradient descent ascent (ZO-AGDA) algorithm and a zeroth-order variance reduced alternating gradient descent ascent (ZO-VRAGDA) algorithm for solving NC-PL minimax problem under the deterministic and the stochastic setting, respectively. The total number of function value queries to obtain an $ε$-stationary point of ZO-AGDA and ZO-VRAGDA algorithm for solving NC-PL minimax problem is upper bounded by $\mathcal{O}(\varepsilon^{-2})$ and $\mathcal{O}(\varepsilon^{-3})$, respectively. To the best of our knowledge, they are the first two zeroth-order algorithms with the iteration complexity gurantee for solving NC-PL minimax problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes