OCNEMar 29, 2021

Saddle Point Optimization with Approximate Minimization Oracle

arXiv:2103.15985v29 citations
AI Analysis

This work addresses optimization problems in machine learning, such as those in GANs, but is incremental as it builds on existing saddle point methods with a new theoretical analysis.

The paper tackles saddle point optimization by analyzing an alternative approach that uses an approximate minimization oracle instead of gradient-based methods, deriving conditions for linear convergence on locally strong convex-concave smooth functions and revealing a shortcoming in robust adversarial reinforcement learning algorithms.

A major approach to saddle point optimization $\min_x\max_y f(x, y)$ is a gradient based approach as is popularized by generative adversarial networks (GANs). In contrast, we analyze an alternative approach relying only on an oracle that solves a minimization problem approximately. Our approach locates approximate solutions $x'$ and $y'$ to $\min_{x'}f(x', y)$ and $\max_{y'}f(x, y')$ at a given point $(x, y)$ and updates $(x, y)$ toward these approximate solutions $(x', y')$ with a learning rate $η$. On locally strong convex--concave smooth functions, we derive conditions on $η$ to exhibit linear convergence to a local saddle point, which reveals a possible shortcoming of recently developed robust adversarial reinforcement learning algorithms. We develop a heuristic approach to adapt $η$ derivative-free and implement zero-order and first-order minimization algorithms. Numerical experiments are conducted to show the tightness of the theoretical results as well as the usefulness of the $η$ adaptation mechanism.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes