LGFeb 12, 2015

Adding vs. Averaging in Distributed Primal-Dual Optimization

arXiv:1502.03508v2178 citations
AI Analysis

This work addresses the communication efficiency problem in distributed machine learning, offering incremental improvements over existing methods.

The paper tackles the communication bottleneck in distributed optimization for large-scale machine learning by introducing CoCoA+, a framework that allows additive combination of local updates instead of averaging, resulting in markedly improved performance on real-world datasets, especially with more machines.

Distributed optimization methods for large-scale machine learning suffer from a communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and accurately aggregating partial work from different machines. In this paper, we present a novel generalization of the recent communication-efficient primal-dual framework (CoCoA) for distributed optimization. Our framework, CoCoA+, allows for additive combination of local updates to the global parameters at each iteration, whereas previous schemes with convergence guarantees only allow conservative averaging. We give stronger (primal-dual) convergence rate guarantees for both CoCoA as well as our new variants, and generalize the theory for both methods to cover non-smooth convex loss functions. We provide an extensive experimental comparison that shows the markedly improved performance of CoCoA+ on several real-world distributed datasets, especially when scaling up the number of machines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes