LGSep 25, 2025

Distribution-Controlled Client Selection to Improve Federated Learning Strategies

arXiv:2509.20877v14.11 citationsh-index: 5

Originality Incremental advance

AI Analysis

This work addresses data imbalance issues in federated learning for domains with privacy constraints, but it is incremental as it extends existing FL strategies.

The paper tackled the problem of data imbalance in federated learning by proposing a client selection method that aligns label distributions with target ones, resulting in improved performance for local and global imbalances, with specific gains demonstrated on three FL strategies and two datasets.

Federated learning (FL) is a distributed learning paradigm that allows multiple clients to jointly train a shared model while maintaining data privacy. Despite its great potential for domains with strict data privacy requirements, the presence of data imbalance among clients is a thread to the success of FL, as it causes the performance of the shared model to decrease. To address this, various studies have proposed enhancements to existing FL strategies, particularly through client selection methods that mitigate the detrimental effects of data imbalance. In this paper, we propose an extension to existing FL strategies, which selects active clients that best align the current label distribution with one of two target distributions, namely a balanced distribution or the federations combined label distribution. Subsequently, we empirically verify the improvements through our distribution-controlled client selection on three common FL strategies and two datasets. Our results show that while aligning the label distribution with a balanced distribution yields the greatest improvements facing local imbalance, alignment with the federation's combined label distribution is superior for global imbalance.

View on arXiv PDF

Similar