OCLGJun 23, 2022

Low-Rank Mirror-Prox for Nonsmooth and Low-Rank Matrix Optimization Problems

arXiv:2206.11523v21 citationsh-index: 31
Originality Incremental advance
AI Analysis

This addresses a bottleneck in machine learning for nonsmooth matrix problems, offering an incremental improvement over existing smooth methods.

The paper tackles nonsmooth low-rank matrix optimization problems by proving that under strict complementarity, low-rank variants of mirror-prox methods converge with a rate of O(1/t) using only two low-rank SVDs per iteration, supported by empirical experiments on matrix recovery tasks.

Low-rank and nonsmooth matrix optimization problems capture many fundamental tasks in statistics and machine learning. While significant progress has been made in recent years in developing efficient methods for \textit{smooth} low-rank optimization problems that avoid maintaining high-rank matrices and computing expensive high-rank SVDs, advances for nonsmooth problems have been slow paced. In this paper we consider standard convex relaxations for such problems. Mainly, we prove that under a \textit{strict complementarity} condition and under the relatively mild assumption that the nonsmooth objective can be written as a maximum of smooth functions, approximated variants of two popular \textit{mirror-prox} methods: the Euclidean \textit{extragradient method} and mirror-prox with \textit{matrix exponentiated gradient updates}, when initialized with a "warm-start", converge to an optimal solution with rate $O(1/t)$, while requiring only two \textit{low-rank} SVDs per iteration. Moreover, for the extragradient method we also consider relaxed versions of strict complementarity which yield a trade-off between the rank of the SVDs required and the radius of the ball in which we need to initialize the method. We support our theoretical results with empirical experiments on several nonsmooth low-rank matrix recovery tasks, demonstrating both the plausibility of the strict complementarity assumption, and the efficient convergence of our proposed low-rank mirror-prox variants.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes