OCLGNov 4, 2021

Quasi-Newton Methods for Saddle Point Problems and Beyond

arXiv:2111.02708v520 citations
Originality Incremental advance
AI Analysis

It addresses optimization challenges in machine learning and related fields by improving convergence for saddle point problems, though it appears incremental as it adapts existing quasi-Newton methods to a new context.

This paper tackles saddle point problems by proposing quasi-Newton methods with Broyden family updates, achieving explicit local superlinear convergence rates such as O((1-1/(nκ^2))^{k(k-1)/2}) and O((1-1/n)^{k(k-1)/2}) for specific algorithms.

This paper studies quasi-Newton methods for solving strongly-convex-strongly-concave saddle point problems (SPP). We propose greedy and random Broyden family updates for SPP, which have explicit local superlinear convergence rate of ${\mathcal O}\big(\big(1-\frac{1}{nκ^2}\big)^{k(k-1)/2}\big)$, where $n$ is dimensions of the problem, $κ$ is the condition number and $k$ is the number of iterations. The design and analysis of proposed algorithm are based on estimating the square of indefinite Hessian matrix, which is different from classical quasi-Newton methods in convex optimization. We also present two specific Broyden family algorithms with BFGS-type and SR1-type updates, which enjoy the faster local convergence rate of $\mathcal O\big(\big(1-\frac{1}{n}\big)^{k(k-1)/2}\big)$. Additionally, we extend our algorithms to solve general nonlinear equations and prove it enjoys the similar convergence rate.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes