DC AI LGJul 14, 2025

Efficient Federated Learning with Heterogeneous Data and Adaptive Dropout

Ji Liu, Beichen Ma, Qiaolin Yu, Ruoming Jin, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Haixun Wang, Dejing Dou, Patrick Valduriez

arXiv:2507.10430v27 citationsh-index: 20ACM Trans Knowl Discov Data

Originality Highly original

AI Analysis

This work addresses efficiency and accuracy challenges in Federated Learning for edge devices with heterogeneous data, representing a strong specific gain but incremental in nature.

The paper tackles the problem of data heterogeneity and stragglers in Federated Learning by proposing the FedDHAD framework, which combines dynamic model aggregation and adaptive dropout to achieve up to 6.7% higher accuracy, 2.02 times faster efficiency, and 15.0% lower computation cost compared to state-of-the-art methods.

Federated Learning (FL) is a promising distributed machine learning approach that enables collaborative training of a global model using multiple edge devices. The data distributed among the edge devices is highly heterogeneous. Thus, FL faces the challenge of data distribution and heterogeneity, where non-Independent and Identically Distributed (non-IID) data across edge devices may yield in significant accuracy drop. Furthermore, the limited computation and communication capabilities of edge devices increase the likelihood of stragglers, thus leading to slow model convergence. In this paper, we propose the FedDHAD FL framework, which comes with two novel methods: Dynamic Heterogeneous model aggregation (FedDH) and Adaptive Dropout (FedAD). FedDH dynamically adjusts the weights of each local model within the model aggregation process based on the non-IID degree of heterogeneous data to deal with the statistical data heterogeneity. FedAD performs neuron-adaptive operations in response to heterogeneous devices to improve accuracy while achieving superb efficiency. The combination of these two methods makes FedDHAD significantly outperform state-of-the-art solutions in terms of accuracy (up to 6.7% higher), efficiency (up to 2.02 times faster), and computation cost (up to 15.0% smaller).

View on arXiv PDF

Similar