LGMLNov 2, 2019

Order Optimal One-Shot Distributed Learning

arXiv:1911.00731v110 citations
Originality Highly original
AI Analysis

This work addresses a bottleneck in distributed learning for scenarios with many machines and limited data per machine, enabling new machine learning paradigms where the number of machines is much larger than the number of samples per machine.

The paper tackles the problem of distributed statistical optimization in a one-shot setting with many machines and few samples per machine, proposing the Multi-Resolution Estimator (MRE) algorithm that achieves an order-optimal error bound of $ ilde{O}ig(m^{-{1}/{\max(d,2)}} n^{-1/2}ig)$, which tends to zero as the number of machines increases even with constant samples per machine.

We consider distributed statistical optimization in one-shot setting, where there are $m$ machines each observing $n$ i.i.d. samples. Based on its observed samples, each machine then sends an $O(\log(mn))$-length message to a server, at which a parameter minimizing an expected loss is to be estimated. We propose an algorithm called Multi-Resolution Estimator (MRE) whose expected error is no larger than $\tilde{O}\big(m^{-{1}/{\max(d,2)}} n^{-1/2}\big)$, where $d$ is the dimension of the parameter space. This error bound meets existing lower bounds up to poly-logarithmic factors, and is thereby order optimal. The expected error of MRE, unlike existing algorithms, tends to zero as the number of machines ($m$) goes to infinity, even when the number of samples per machine ($n$) remains upper bounded by a constant. This property of the MRE algorithm makes it applicable in new machine learning paradigms where $m$ is much larger than $n$.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes