LG DC OC MLOct 14, 2019

SCAFFOLD: Stochastic Controlled Averaging for Federated Learning

Sai Praneeth Karimireddy, Satyen Kale, Mehryar Mohri, Sashank J. Reddi, Sebastian U. Stich, Ananda Theertha Suresh

arXiv:1910.06378v454.9549 citations

Originality Highly original

AI Analysis

This addresses a key bottleneck in federated learning for distributed systems with non-iid data, offering a more efficient and stable algorithm.

The paper tackled the problem of slow and unstable convergence in Federated Averaging (FedAvg) due to client-drift in heterogeneous data, proposing SCAFFOLD, which uses control variates to correct this and requires significantly fewer communication rounds while being unaffected by data heterogeneity.

Federated Averaging (FedAvg) has emerged as the algorithm of choice for federated learning due to its simplicity and low communication cost. However, in spite of recent research efforts, its performance is not fully understood. We obtain tight convergence rates for FedAvg and prove that it suffers from `client-drift' when the data is heterogeneous (non-iid), resulting in unstable and slow convergence. As a solution, we propose a new algorithm (SCAFFOLD) which uses control variates (variance reduction) to correct for the `client-drift' in its local updates. We prove that SCAFFOLD requires significantly fewer communication rounds and is not affected by data heterogeneity or client sampling. Further, we show that (for quadratics) SCAFFOLD can take advantage of similarity in the client's data yielding even faster convergence. The latter is the first result to quantify the usefulness of local-steps in distributed optimization.

View on arXiv PDF

Similar