LGMLNov 19, 2021

An Expectation-Maximization Perspective on Federated Learning

arXiv:2111.10192v113 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical unification for federated learning methods, with incremental extensions for sparsity to address practical efficiency issues in distributed settings.

The paper tackles the problem of interpreting federated learning as a hierarchical latent variable model, showing that FedAvg corresponds to a hard-EM algorithm with Gaussian priors, and proposes FedSparse to learn sparse neural networks, reducing communication and computational costs.

Federated learning describes the distributed training of models across multiple clients while keeping the data private on-device. In this work, we view the server-orchestrated federated learning process as a hierarchical latent variable model where the server provides the parameters of a prior distribution over the client-specific model parameters. We show that with simple Gaussian priors and a hard version of the well known Expectation-Maximization (EM) algorithm, learning in such a model corresponds to FedAvg, the most popular algorithm for the federated learning setting. This perspective on FedAvg unifies several recent works in the field and opens up the possibility for extensions through different choices for the hierarchical model. Based on this view, we further propose a variant of the hierarchical model that employs prior distributions to promote sparsity. By similarly using the hard-EM algorithm for learning, we obtain FedSparse, a procedure that can learn sparse neural networks in the federated learning setting. FedSparse reduces communication costs from client to server and vice-versa, as well as the computational costs for inference with the sparsified network - both of which are of great practical importance in federated learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes