LGDCMLMar 3, 2020

Evaluation Framework For Large-scale Federated Learning

arXiv:2003.01575v20.1012 citationsHas Code
AI Analysis15

This work addresses the problem of unstable model performance in federated learning due to non-IID data for researchers and practitioners, but it is incremental as it focuses on evaluation rather than new learning methods.

The paper tackles the challenge of evaluating federated learning systems under realistic non-IID data conditions by introducing a framework that includes open-source non-IID datasets and rigorous evaluation metrics, resulting in a benchmark for large-scale federated learning research.

Federated learning is proposed as a machine learning setting to enable distributed edge devices, such as mobile phones, to collaboratively learn a shared prediction model while keeping all the training data on device, which can not only take full advantage of data distributed across millions of nodes to train a good model but also protect data privacy. However, learning in scenario above poses new challenges. In fact, data across a massive number of unreliable devices is likely to be non-IID (identically and independently distributed), which may make the performance of models trained by federated learning unstable. In this paper, we introduce a framework designed for large-scale federated learning which consists of approaches to generating dataset and modular evaluation framework. Firstly, we construct a suite of open-source non-IID datasets by providing three respects including covariate shift, prior probability shift, and concept shift, which are grounded in real-world assumptions. In addition, we design several rigorous evaluation metrics including the number of network nodes, the size of datasets, the number of communication rounds and communication resources etc. Finally, we present an open-source benchmark for large-scale federated learning research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes