LGAINov 12, 2020

Heterogeneous Data-Aware Federated Learning

arXiv:2011.06393v118 citations
AI Analysis

It addresses deployment challenges in federated learning for privacy-preserving distributed training, but appears incremental as it builds on existing FL frameworks.

The paper tackles problems in federated learning like non-i.i.d. data and disjoint classes by proposing a method that aggregates generic model parameters on the server while keeping client-specific parameters, showing significant advantages on extreme cases in benchmarks like Femnist and a proprietary traffic dataset.

Federated learning (FL) is an appealing concept to perform distributed training of Neural Networks (NN) while keeping data private. With the industrialization of the FL framework, we identify several problems hampering its successful deployment, such as presence of non i.i.d data, disjoint classes, signal multi-modality across datasets. In this work, we address these problems by proposing a novel method that not only (1) aggregates generic model parameters (e.g. a common set of task generic NN layers) on server (e.g. in traditional FL), but also (2) keeps a set of parameters (e.g, a set of task specific NN layer) specific to each client. We validate our method on the traditionally used public benchmarks (e.g., Femnist) as well as on our proprietary collected dataset (i.e., traffic classification). Results show the benefit of our method, with significant advantage on extreme cases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes