LGAIDCDec 21, 2023

Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning

arXiv:2312.13923v29 citationsh-index: 16NIPS
Originality Incremental advance
AI Analysis

This addresses data heterogeneity challenges in federated learning for distributed clients, but it is incremental as it builds on prior work by unifying two previously separate issues.

The paper tackles the problem of severe data heterogeneity, including both label distribution skew and feature skew, in federated learning by proposing Fed-CO2, a framework that cooperates online and offline models. The result shows that Fed-CO2 outperforms existing personalized federated learning algorithms in handling these issues, as demonstrated through extensive experiments.

Federated Learning (FL) has emerged as a promising distributed learning paradigm that enables multiple clients to learn a global model collaboratively without sharing their private data. However, the effectiveness of FL is highly dependent on the quality of the data that is being used for training. In particular, data heterogeneity issues, such as label distribution skew and feature skew, can significantly impact the performance of FL. Previous studies in FL have primarily focused on addressing label distribution skew data heterogeneity, while only a few recent works have made initial progress in tackling feature skew issues. Notably, these two forms of data heterogeneity have been studied separately and have not been well explored within a unified FL framework. To address this gap, we propose Fed-CO$_{2}$, a universal FL framework that handles both label distribution skew and feature skew within a \textbf{C}ooperation mechanism between the \textbf{O}nline and \textbf{O}ffline models. Specifically, the online model learns general knowledge that is shared among all clients, while the offline model is trained locally to learn the specialized knowledge of each individual client. To further enhance model cooperation in the presence of feature shifts, we design an intra-client knowledge transfer mechanism that reinforces mutual learning between the online and offline models, and an inter-client knowledge transfer mechanism to increase the models' domain generalization ability. Extensive experiments show that our Fed-CO$_{2}$ outperforms a wide range of existing personalized federated learning algorithms in terms of handling label distribution skew and feature skew, both individually and collectively. The empirical results are supported by our convergence analyses in a simplified setting.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes