LGDCOCAug 10, 2021

Decentralized Composite Optimization with Compression

arXiv:2108.04448v210 citations
AI Analysis

This addresses communication bottlenecks in distributed machine learning for scenarios involving non-smooth optimization, representing an incremental improvement over existing methods focused only on smooth components.

The paper tackles decentralized stochastic composite optimization with non-smooth components by proposing Prox-LEAD, an algorithm that reduces communication cost almost for free while working with arbitrary compression precision, as demonstrated through convergence complexities and numerical experiments.

Decentralized optimization and communication compression have exhibited their great potential in accelerating distributed machine learning by mitigating the communication bottleneck in practice. While existing decentralized algorithms with communication compression mostly focus on the problems with only smooth components, we study the decentralized stochastic composite optimization problem with a potentially non-smooth component. A \underline{Prox}imal gradient \underline{L}in\underline{EA}r convergent \underline{D}ecentralized algorithm with compression, Prox-LEAD, is proposed with rigorous theoretical analyses in the general stochastic setting and the finite-sum setting. Our theorems indicate that Prox-LEAD works with arbitrary compression precision, and it tremendously reduces the communication cost almost for free. The superiorities of the proposed algorithms are demonstrated through the comparison with state-of-the-art algorithms in terms of convergence complexities and numerical experiments. Our algorithmic framework also generally enlightens the compressed communication on other primal-dual algorithms by reducing the impact of inexact iterations, which might be of independent interest.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes