LGAug 4, 2024

Personalized Federated Learning on Heterogeneous and Long-Tailed Data via Expert Collaborative Learning

arXiv:2408.02019v11 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses performance degradation in federated learning for domains like healthcare where data is heterogeneous and long-tailed, representing an incremental improvement.

The paper tackles the joint problem of global long-tailed distribution and data heterogeneity in Personalized Federated Learning by proposing Expert Collaborative Learning, which outperforms state-of-the-art methods on benchmark datasets.

Personalized Federated Learning (PFL) aims to acquire customized models for each client without disclosing raw data by leveraging the collective knowledge of distributed clients. However, the data collected in real-world scenarios is likely to follow a long-tailed distribution. For example, in the medical domain, it is more common for the number of general health notes to be much larger than those specifically relatedto certain diseases. The presence of long-tailed data can significantly degrade the performance of PFL models. Additionally, due to the diverse environments in which each client operates, data heterogeneity is also a classic challenge in federated learning. In this paper, we explore the joint problem of global long-tailed distribution and data heterogeneity in PFL and propose a method called Expert Collaborative Learning (ECL) to tackle this problem. Specifically, each client has multiple experts, and each expert has a different training subset, which ensures that each class, especially the minority classes, receives sufficient training. Multiple experts collaborate synergistically to produce the final prediction output. Without special bells and whistles, the vanilla ECL outperforms other state-of-the-art PFL methods on several benchmark datasets under different degrees of data heterogeneity and long-tailed distribution.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes