LGAIJan 25, 2023

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

Peking UTsinghua
arXiv:2301.10394v28 citationsh-index: 49
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving model fairness and accuracy in federated learning systems with imbalanced data distributions, which is crucial for real-world applications like healthcare or finance, but it is incremental as it builds on decoupled training approaches.

The paper tackles the problem of federated learning on long-tailed data, where the global model becomes biased towards head classes, by integrating local real data with global gradient prototypes to form balanced datasets and re-balancing the classifier locally, resulting in consistent outperformance over state-of-the-art methods in various settings.

Federated Learning (FL) has become a popular distributed learning paradigm that involves multiple clients training a global model collaboratively in a data privacy-preserving manner. However, the data samples usually follow a long-tailed distribution in the real world, and FL on the decentralized and long-tailed data yields a poorly-behaved global model severely biased to the head classes with the majority of the training samples. To alleviate this issue, decoupled training has recently been introduced to FL, considering it has achieved promising results in centralized long-tailed learning by re-balancing the biased classifier after the instance-balanced training. However, the current study restricts the capacity of decoupled training in federated long-tailed learning with a sub-optimal classifier re-trained on a set of pseudo features, due to the unavailability of a global balanced dataset in FL. In this work, in order to re-balance the classifier more effectively, we integrate the local real data with the global gradient prototypes to form the local balanced datasets, and thus re-balance the classifier during the local training. Furthermore, we introduce an extra classifier in the training phase to help model the global data distribution, which addresses the problem of contradictory optimization goals caused by performing classifier re-balancing locally. Extensive experiments show that our method consistently outperforms the existing state-of-the-art methods in various settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes