LGAIMar 25, 2023

Federated Learning without Full Labels: A Survey

arXiv:2303.14453v137 citationsh-index: 71
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that identifies and organizes existing solutions for a practical problem in federated learning, relevant for researchers and practitioners dealing with privacy-preserving machine learning on decentralized data.

The paper addresses the challenge of applying federated learning when data is not fully labeled, which is common in practice due to participants lacking expertise or tools for labeling. It surveys methods that integrate federated learning with techniques like semi-supervised, self-supervised, and transfer learning to leverage unlabeled data.

Data privacy has become an increasingly important concern in real-world big data applications such as machine learning. To address the problem, federated learning (FL) has been a promising solution to building effective machine learning models from decentralized and private data. Existing federated learning algorithms mainly tackle the supervised learning problem, where data are assumed to be fully labeled. However, in practice, fully labeled data is often hard to obtain, as the participants may not have sufficient domain expertise, or they lack the motivation and tools to label data. Therefore, the problem of federated learning without full labels is important in real-world FL applications. In this paper, we discuss how the problem can be solved with machine learning techniques that leverage unlabeled data. We present a survey of methods that combine FL with semi-supervised learning, self-supervised learning, and transfer learning methods. We also summarize the datasets used to evaluate FL methods without full labels. Finally, we highlight future directions in the context of FL without full labels.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes