LGAug 12, 2021

An Operator Splitting View of Federated Learning

arXiv:2108.05974v43 citations
AI Analysis

This work addresses the need for a unified theoretical framework in federated learning, which is incremental as it builds on existing algorithms to improve understanding and efficiency.

The paper tackles the fragmented theory and lack of formal comparison in federated learning algorithms by unifying them from an operator splitting perspective, enabling easier comparisons, refined convergence results, and new algorithmic variants, with numerical experiments validating the findings.

Over the past few years, the federated learning ($\texttt{FL}$) community has witnessed a proliferation of new $\texttt{FL}$ algorithms. However, our understating of the theory of $\texttt{FL}$ is still fragmented, and a thorough, formal comparison of these algorithms remains elusive. Motivated by this gap, we show that many of the existing $\texttt{FL}$ algorithms can be understood from an operator splitting point of view. This unification allows us to compare different algorithms with ease, to refine previous convergence results and to uncover new algorithmic variants. In particular, our analysis reveals the vital role played by the step size in $\texttt{FL}$ algorithms. The unification also leads to a streamlined and economic way to accelerate $\texttt{FL}$ algorithms, without incurring any communication overhead. We perform numerical experiments on both convex and nonconvex models to validate our findings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes