LGAIAug 15, 2023

NeFL: Nested Model Scaling for Federated Learning with System Heterogeneous Clients

arXiv:2308.07761v312 citationsh-index: 54Has Code
Originality Incremental advance
AI Analysis

This addresses efficiency and performance issues in federated learning for resource-constrained devices, though it is incremental as it builds on prior submodel approaches.

The paper tackles the problem of stragglers in federated learning due to system heterogeneity by proposing NeFL, a framework that divides neural networks into submodels using depthwise and widthwise scaling, achieving a 7.63% performance improvement for the worst-case submodel on CIFAR-100 compared to baselines.

Federated learning (FL) enables distributed training while preserving data privacy, but stragglers-slow or incapable clients-can significantly slow down the total training time and degrade performance. To mitigate the impact of stragglers, system heterogeneity, including heterogeneous computing and network bandwidth, has been addressed. While previous studies have addressed system heterogeneity by splitting models into submodels, they offer limited flexibility in model architecture design, without considering potential inconsistencies arising from training multiple submodel architectures. We propose nested federated learning (NeFL), a generalized framework that efficiently divides deep neural networks into submodels using both depthwise and widthwise scaling. To address the inconsistency arising from training multiple submodel architectures, NeFL decouples a subset of parameters from those being trained for each submodel. An averaging method is proposed to handle these decoupled parameters during aggregation. NeFL enables resource-constrained devices to effectively participate in the FL pipeline, facilitating larger datasets for model training. Experiments demonstrate that NeFL achieves performance gain, especially for the worst-case submodel compared to baseline approaches (7.63% improvement on CIFAR-100). Furthermore, NeFL aligns with recent advances in FL, such as leveraging pre-trained models and accounting for statistical heterogeneity. Our code is available online.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes