DC AI LG MLApr 9, 2014

A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning

Aurélien Bellet, Yingyu Liang, Alireza Bagheri Garakani, Maria-Florina Balcan, Fei Sha

arXiv:1404.2644v348 citations

Originality Incremental advance

AI Analysis

This addresses communication efficiency in distributed sparse learning, which is incremental as it builds on existing Frank-Wolfe methods for a specific bottleneck.

The paper tackles the problem of learning sparse combinations in a distributed setting, where elements are spread over a network, by proposing a distributed Frank-Wolfe algorithm that achieves theoretical guarantees on optimization error and communication cost, with empirical validation showing it outperforms baselines and competing methods.

Learning sparse combinations is a frequent theme in machine learning. In this paper, we study its associated optimization problem in the distributed setting where the elements to be combined are not centrally located but spread over a network. We address the key challenges of balancing communication costs and optimization errors. To this end, we propose a distributed Frank-Wolfe (dFW) algorithm. We obtain theoretical guarantees on the optimization error $ε$ and communication cost that do not depend on the total number of combining elements. We further show that the communication cost of dFW is optimal by deriving a lower-bound on the communication cost required to construct an $ε$-approximate solution. We validate our theoretical analysis with empirical studies on synthetic and real-world data, which demonstrate that dFW outperforms both baselines and competing methods. We also study the performance of dFW when the conditions of our analysis are relaxed, and show that dFW is fairly robust.

View on arXiv PDF

Similar