LGJan 18, 2022

Towards Federated Clustering: A Federated Fuzzy $c$-Means Algorithm (FFCM)

arXiv:2201.07316v153 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of clustering data across distributed parties without sharing data, which is incremental as it adapts an existing algorithm to federated learning.

The paper tackles federated clustering by extending the fuzzy c-means algorithm to a federated setting (FFCM), proposing two methods for calculating global cluster centers and evaluating them through numerical experiments, with one method showing good performance in challenging scenarios.

Federated Learning (FL) is a setting where multiple parties with distributed data collaborate in training a joint Machine Learning (ML) model while keeping all data local at the parties. Federated clustering is an area of research within FL that is concerned with grouping together data that is globally similar while keeping all data local. We describe how this area of research can be of interest in itself, or how it helps addressing issues like non-independently-identically-distributed (i.i.d.) data in supervised FL frameworks. The focus of this work, however, is an extension of the federated fuzzy $c$-means algorithm to the FL setting (FFCM) as a contribution towards federated clustering. We propose two methods to calculate global cluster centers and evaluate their behaviour through challenging numerical experiments. We observe that one of the methods is able to identify good global clusters even in challenging scenarios, but also acknowledge that many challenges remain open.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes