OCLGNAFeb 21, 2014

Convergence results for projected line-search methods on varieties of low-rank matrices via Łojasiewicz inequality

arXiv:1402.5284v3144 citations
Originality Incremental advance
AI Analysis

This work addresses convergence issues in low-rank matrix optimization for researchers in numerical analysis and optimization, offering incremental improvements by extending known methods to non-smooth varieties.

This paper tackles the problem of optimizing functions on the variety of low-rank matrices by extending projected line-search methods to handle non-smoothness and unbounded curvature, deriving pointwise convergence results using a Łojasiewicz inequality. The result shows that convergence can be achieved without a priori curvature bounds when the limit point lies on the smooth part, providing asymptotic rate estimates for specific step-sizes.

The aim of this paper is to derive convergence results for projected line-search methods on the real-algebraic variety $\mathcal{M}_{\le k}$ of real $m \times n$ matrices of rank at most $k$. Such methods extend Riemannian optimization methods, which are successfully used on the smooth manifold $\mathcal{M}_k$ of rank-$k$ matrices, to its closure by taking steps along gradient-related directions in the tangent cone, and afterwards projecting back to $\mathcal{M}_{\le k}$. Considering such a method circumvents the difficulties which arise from the nonclosedness and the unbounded curvature of $\mathcal{M}_k$. The pointwise convergence is obtained for real-analytic functions on the basis of a Łojasiewicz inequality for the projection of the antigradient to the tangent cone. If the derived limit point lies on the smooth part of $\mathcal{M}_{\le k}$, i.e. in $\mathcal{M}_k$, this boils down to more or less known results, but with the benefit that asymptotic convergence rate estimates (for specific step-sizes) can be obtained without an a priori curvature bound, simply from the fact that the limit lies on a smooth manifold. At the same time, one can give a convincing justification for assuming critical points to lie in $\mathcal{M}_k$: if $X$ is a critical point of $f$ on $\mathcal{M}_{\le k}$, then either $X$ has rank $k$, or $\nabla f(X) = 0$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes