OCLGOct 25, 2022

Faster Projection-Free Augmented Lagrangian Methods via Weak Proximal Oracle

arXiv:2210.13968v24 citationsh-index: 31
Originality Highly original
AI Analysis

This work addresses high-dimensional optimization problems where exact projections are intractable, offering a faster alternative to existing projection-free methods for applications like low-rank matrix recovery.

The paper tackles convex composite optimization with affine constraints by proposing a projection-free augmented Lagrangian method using a weak proximal oracle, achieving an ergodic convergence rate of O(1/T) for objective residual and feasibility gap, which improves upon the O(1/√T) rate of existing methods, as demonstrated in experiments on low-rank and sparse covariance matrix estimation and Max Cut semidefinite relaxation.

This paper considers a convex composite optimization problem with affine constraints, which includes problems that take the form of minimizing a smooth convex objective function over the intersection of (simple) convex sets, or regularized with multiple (simple) functions. Motivated by high-dimensional applications in which exact projection/proximal computations are not tractable, we propose a \textit{projection-free} augmented Lagrangian-based method, in which primal updates are carried out using a \textit{weak proximal oracle} (WPO). In an earlier work, WPO was shown to be more powerful than the standard \textit{linear minimization oracle} (LMO) that underlies conditional gradient-based methods (aka Frank-Wolfe methods). Moreover, WPO is computationally tractable for many high-dimensional problems of interest, including those motivated by recovery of low-rank matrices and tensors, and optimization over polytopes which admit efficient LMOs. The main result of this paper shows that under a certain curvature assumption (which is weaker than strong convexity), our WPO-based algorithm achieves an ergodic rate of convergence of $O(1/T)$ for both the objective residual and feasibility gap. This result, to the best of our knowledge, improves upon the $O(1/\sqrt{T})$ rate for existing LMO-based projection-free methods for this class of problems. Empirical experiments on a low-rank and sparse covariance matrix estimation task and the Max Cut semidefinite relaxation demonstrate that of our method can outperform state-of-the-art LMO-based Lagrangian-based methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes