LGAIOCAug 30, 2024

Painless Federated Learning: An Interplay of Line-Search and Extrapolation

arXiv:2408.17145v2h-index: 1
AI Analysis

This work addresses the challenge of efficient and robust optimization in federated learning, which is crucial for privacy-preserving machine learning in distributed systems, but it is incremental as it builds on existing line-search and extrapolation techniques.

The authors tackled the problem of convergence slowdown in federated learning due to client heterogeneity and gradient noise by introducing FedSLS and FedExpSLS algorithms, which achieve deterministic linear convergence for strongly convex objectives and perform competitively or better than existing methods across various problems.

The classical line search for learning rate (LR) tuning in the stochastic gradient descent (SGD) algorithm can tame the convergence slowdown due to data-sampling noise. In a federated setting, wherein the client heterogeneity introduces a slowdown to the global convergence, line search can be relevantly adapted. In this work, we show that a stochastic variant of line search tames the heterogeneity in federated optimization in addition to that due to client-local gradient noise. To this end, we introduce Federated Stochastic Line Search (FedSLS) algorithm and show that it achieves deterministic rates in expectation. Specifically, FedSLS offers linear convergence for strongly convex objectives even with partial client participation. Recently, the extrapolation of the server's LR has shown promise for improved empirical performance for federated learning. To benefit from extrapolation, we extend FedSLS to Federated Extrapolated Stochastic Line Search (FedExpSLS) and prove its convergence. Our extensive empirical results show that the proposed methods perform at par or better than the popular federated learning algorithms across many convex and non-convex problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes