LGSep 25, 2025

Distribution-Controlled Client Selection to Improve Federated Learning Strategies

arXiv:2509.20877v11 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses data imbalance issues in federated learning for domains with privacy constraints, but it is incremental as it extends existing FL strategies.

The paper tackled the problem of data imbalance in federated learning by proposing a client selection method that aligns label distributions with target ones, resulting in improved performance for local and global imbalances, with specific gains demonstrated on three FL strategies and two datasets.

Federated learning (FL) is a distributed learning paradigm that allows multiple clients to jointly train a shared model while maintaining data privacy. Despite its great potential for domains with strict data privacy requirements, the presence of data imbalance among clients is a thread to the success of FL, as it causes the performance of the shared model to decrease. To address this, various studies have proposed enhancements to existing FL strategies, particularly through client selection methods that mitigate the detrimental effects of data imbalance. In this paper, we propose an extension to existing FL strategies, which selects active clients that best align the current label distribution with one of two target distributions, namely a balanced distribution or the federations combined label distribution. Subsequently, we empirically verify the improvements through our distribution-controlled client selection on three common FL strategies and two datasets. Our results show that while aligning the label distribution with a balanced distribution yields the greatest improvements facing local imbalance, alignment with the federation's combined label distribution is superior for global imbalance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes