LGMar 8, 2023

Memory-adaptive Depth-wise Heterogeneous Federated Learning

Kai Zhang, Yutong Dai, Hongyi Wang, Eric Xing, Xun Chen, Lichao Sun

arXiv:2303.04887v310.711 citationsh-index: 35Has Code

Originality Incremental advance

AI Analysis

This addresses performance degradation in federated learning for heterogeneous devices like mobile phones and IoT, offering an incremental improvement over existing width-slimming methods.

The paper tackles the problem of memory heterogeneity in federated learning by introducing FeDepth, a method that adaptively decomposes models into blocks based on client memory budgets and trains them sequentially, resulting in improvements of 5% and over 10% in top-1 accuracy on CIFAR-10 and CIFAR-100, respectively.

Federated learning is a promising paradigm that allows multiple clients to collaboratively train a model without sharing the local data. However, the presence of heterogeneous devices in federated learning, such as mobile phones and IoT devices with varying memory capabilities, would limit the scale and hence the performance of the model could be trained. The mainstream approaches to address memory limitations focus on width-slimming techniques, where different clients train subnetworks with reduced widths locally and then the server aggregates the subnetworks. The global model produced from these methods suffers from performance degradation due to the negative impact of the actions taken to handle the varying subnetwork widths in the aggregation phase. In this paper, we introduce a memory-adaptive depth-wise learning solution in FL called FeDepth, which adaptively decomposes the full model into blocks according to the memory budgets of each client and trains blocks sequentially to obtain a full inference model. Our method outperforms state-of-the-art approaches, achieving 5% and more than 10% improvements in top-1 accuracy on CIFAR-10 and CIFAR-100, respectively. We also demonstrate the effectiveness of depth-wise fine-tuning on ViT. Our findings highlight the importance of memory-aware techniques for federated learning with heterogeneous devices and the success of depth-wise training strategy in improving the global model's performance.

View on arXiv PDF Code

Similar