OC LG MLNov 24, 2022

Zeroth-Order Alternating Gradient Descent Ascent Algorithms for a Class of Nonconvex-Nonconcave Minimax Problems

Zi Xu, Zi-Qi Wang, Jun-Lin Wang, Yu-Hong Dai

arXiv:2211.13668v213.615 citationsh-index: 10

Originality Highly original

AI Analysis

This addresses optimization challenges in machine learning for problems where gradient information is unavailable, offering the first zeroth-order methods with theoretical guarantees for this class of minimax problems.

The paper tackles nonconvex-nonconcave minimax problems satisfying the Polyak-Łojasiewicz condition by proposing two zeroth-order algorithms, achieving iteration complexities of O(ε^{-2}) and O(ε^{-3}) for deterministic and stochastic settings, respectively.

In this paper, we consider a class of nonconvex-nonconcave minimax problems, i.e., NC-PL minimax problems, whose objective functions satisfy the Polyak-Łojasiewicz (PL) condition with respect to the inner variable. We propose a zeroth-order alternating gradient descent ascent (ZO-AGDA) algorithm and a zeroth-order variance reduced alternating gradient descent ascent (ZO-VRAGDA) algorithm for solving NC-PL minimax problem under the deterministic and the stochastic setting, respectively. The total number of function value queries to obtain an $ε$-stationary point of ZO-AGDA and ZO-VRAGDA algorithm for solving NC-PL minimax problem is upper bounded by $\mathcal{O}(\varepsilon^{-2})$ and $\mathcal{O}(\varepsilon^{-3})$, respectively. To the best of our knowledge, they are the first two zeroth-order algorithms with the iteration complexity gurantee for solving NC-PL minimax problems.

View on arXiv PDF

Similar