LGSPMay 24, 2023

Stochastic Unrolled Federated Learning

arXiv:2305.15371v29 citations
Originality Incremental advance
AI Analysis

This work addresses convergence speed in federated learning, an incremental improvement for decentralized machine learning systems.

The paper tackles the challenge of applying algorithm unrolling to federated learning by introducing SURF, which uses stochastic mini-batches and a GNN-based architecture to expedite convergence, achieving theoretical near-optimal guarantees and effectiveness in collaborative image classifier training.

Algorithm unrolling has emerged as a learning-based optimization paradigm that unfolds truncated iterative algorithms in trainable neural-network optimizers. We introduce Stochastic UnRolled Federated learning (SURF), a method that expands algorithm unrolling to federated learning in order to expedite its convergence. Our proposed method tackles two challenges of this expansion, namely the need to feed whole datasets to the unrolled optimizers to find a descent direction and the decentralized nature of federated learning. We circumvent the former challenge by feeding stochastic mini-batches to each unrolled layer and imposing descent constraints to guarantee its convergence. We address the latter challenge by unfolding the distributed gradient descent (DGD) algorithm in a graph neural network (GNN)-based unrolled architecture, which preserves the decentralized nature of training in federated learning. We theoretically prove that our proposed unrolled optimizer converges to a near-optimal region infinitely often. Through extensive numerical experiments, we also demonstrate the effectiveness of the proposed framework in collaborative training of image classifiers.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes