LGOCApr 12, 2024

Federated Optimization with Doubly Regularized Drift Correction

arXiv:2404.08447v118 citationsh-index: 6ICML
Originality Incremental advance
AI Analysis

This work addresses communication and computational inefficiencies in federated learning, which is crucial for training models on decentralized devices, but it appears incremental as it builds on established methods like DANE.

The paper tackles the problem of client drift in federated learning, which hampers performance and increases communication costs, by proposing FedRed, a method that achieves improved local computational complexity while maintaining the same communication complexity as existing methods.

Federated learning is a distributed optimization paradigm that allows training machine learning models across decentralized devices while keeping the data localized. The standard method, FedAvg, suffers from client drift which can hamper performance and increase communication costs over centralized methods. Previous works proposed various strategies to mitigate drift, yet none have shown uniformly improved communication-computation trade-offs over vanilla gradient descent. In this work, we revisit DANE, an established method in distributed optimization. We show that (i) DANE can achieve the desired communication reduction under Hessian similarity constraints. Furthermore, (ii) we present an extension, DANE+, which supports arbitrary inexact local solvers and has more freedom to choose how to aggregate the local updates. We propose (iii) a novel method, FedRed, which has improved local computational complexity and retains the same communication complexity compared to DANE/DANE+. This is achieved by using doubly regularized drift correction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes