LGFeb 13, 2022

On the Convergence of Clustered Federated Learning

arXiv:2202.06187v262 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of data heterogeneity in federated learning for distributed systems, offering a trade-off between shared and personalized models, but it is incremental as it builds on existing clustered FL methods.

The paper tackles the non-IID data challenge in federated learning by proposing a bi-level optimization framework for clustered federated learning, which clusters similar clients and learns shared models per cluster, with empirical verification showing effectiveness under cluster-wise non-IID settings.

Knowledge sharing and model personalization are essential components to tackle the non-IID challenge in federated learning (FL). Most existing FL methods focus on two extremes: 1) to learn a shared model to serve all clients with non-IID data, and 2) to learn personalized models for each client, namely personalized FL. There is a trade-off solution, namely clustered FL or cluster-wise personalized FL, which aims to cluster similar clients into one cluster, and then learn a shared model for all clients within a cluster. This paper is to revisit the research of clustered FL by formulating them into a bi-level optimization framework that could unify existing methods. We propose a new theoretical analysis framework to prove the convergence by considering the clusterability among clients. In addition, we embody this framework in an algorithm, named Weighted Clustered Federated Learning (WeCFL). Empirical analysis verifies the theoretical results and demonstrates the effectiveness of the proposed WeCFL under the proposed cluster-wise non-IID settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes