LGAICVDCJul 19, 2022

SphereFed: Hyperspherical Federated Learning

DeepMind
arXiv:2207.09413v128 citationsh-index: 28
Originality Incremental advance
AI Analysis

This addresses data heterogeneity issues in federated learning for decentralized devices, representing an incremental improvement over existing methods.

The paper tackles the problem of non-i.i.d. data in federated learning by introducing SphereFed, a framework that constrains learned representations to a unit hypersphere, resulting in accuracy improvements of up to 6% on challenging datasets.

Federated Learning aims at training a global model from multiple decentralized devices (i.e. clients) without exchanging their private local data. A key challenge is the handling of non-i.i.d. (independent identically distributed) data across multiple clients that may induce disparities of their local features. We introduce the Hyperspherical Federated Learning (SphereFed) framework to address the non-i.i.d. issue by constraining learned representations of data points to be on a unit hypersphere shared by clients. Specifically, all clients learn their local representations by minimizing the loss with respect to a fixed classifier whose weights span the unit hypersphere. After federated training in improving the global model, this classifier is further calibrated with a closed-form solution by minimizing a mean squared loss. We show that the calibration solution can be computed efficiently and distributedly without direct access of local data. Extensive experiments indicate that our SphereFed approach is able to improve the accuracy of multiple existing federated learning algorithms by a considerable margin (up to 6% on challenging datasets) with enhanced computation and communication efficiency across datasets and model architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes