LGFeb 14, 2024

Momentum Approximation in Asynchronous Private Federated Learning

arXiv:2402.09247v25 citationsh-index: 2
AI Analysis

This addresses scalability and model quality issues in federated learning for large-scale systems, representing an incremental improvement.

The paper tackles the problem of degraded performance when applying momentum to asynchronous federated learning by identifying implicit bias in updates and proposing momentum approximation to minimize it, achieving a 1.15–4x speedup in convergence on benchmark datasets.

Asynchronous protocols have been shown to improve the scalability of federated learning (FL) with a massive number of clients. Meanwhile, momentum-based methods can achieve the best model quality in synchronous FL. However, naively applying momentum in asynchronous FL algorithms leads to slower convergence and degraded model performance. It is still unclear how to effective combinie these two techniques together to achieve a win-win. In this paper, we find that asynchrony introduces implicit bias to momentum updates. In order to address this problem, we propose momentum approximation that minimizes the bias by finding an optimal weighted average of all historical model updates. Momentum approximation is compatible with secure aggregation as well as differential privacy, and can be easily integrated in production FL systems with a minor communication and storage cost. We empirically demonstrate that on benchmark FL datasets, momentum approximation can achieve $1.15 \textrm{--}4\times$ speed up in convergence compared to naively combining asynchronous FL with momentum.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes