LGOCMay 19, 2024

Retraction-Free Decentralized Non-convex Optimization with Orthogonal Constraints

arXiv:2405.11590v29 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses the problem of computational inefficiency in decentralized optimization for researchers and practitioners in machine learning and signal processing, offering an incremental improvement by adapting a centralized method to a decentralized setting.

The paper tackles decentralized non-convex optimization with orthogonal constraints by proposing DRFGT, a retraction-free algorithm that avoids costly projections like SVD, achieving an ergodic convergence rate of O(1/K) and linear convergence under certain conditions, with numerical experiments showing performance on par with state-of-the-art methods while reducing computational overhead.

In this paper, we investigate decentralized non-convex optimization with orthogonal constraints. Conventional algorithms for this setting require either manifold retractions or other types of projection to ensure feasibility, both of which involve costly linear algebra operations (e.g., SVD or matrix inversion). On the other hand, infeasible methods are able to provide similar performance with higher computational efficiency. Inspired by this, we propose the first decentralized version of the retraction-free landing algorithm, called \textbf{D}ecentralized \textbf{R}etraction-\textbf{F}ree \textbf{G}radient \textbf{T}racking (DRFGT). We theoretically prove that DRFGT enjoys the ergodic convergence rate of $\mathcal{O}(1/K)$, matching the convergence rate of centralized, retraction-based methods. We further establish that under a local Riemannian PŁ condition, DRFGT achieves a much faster linear convergence rate. Numerical experiments demonstrate that DRFGT performs on par with the state-of-the-art retraction-based methods with substantially reduced computational overhead.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes