LG AIMar 25, 2023

Federated Learning without Full Labels: A Survey

Yilun Jin, Yang Liu, Kai Chen, Qiang Yang

arXiv:2303.14453v113.737 citationsh-index: 71

Originality Synthesis-oriented

AI Analysis

This is an incremental survey that identifies and organizes existing solutions for a practical problem in federated learning, relevant for researchers and practitioners dealing with privacy-preserving machine learning on decentralized data.

The paper addresses the challenge of applying federated learning when data is not fully labeled, which is common in practice due to participants lacking expertise or tools for labeling. It surveys methods that integrate federated learning with techniques like semi-supervised, self-supervised, and transfer learning to leverage unlabeled data.

Data privacy has become an increasingly important concern in real-world big data applications such as machine learning. To address the problem, federated learning (FL) has been a promising solution to building effective machine learning models from decentralized and private data. Existing federated learning algorithms mainly tackle the supervised learning problem, where data are assumed to be fully labeled. However, in practice, fully labeled data is often hard to obtain, as the participants may not have sufficient domain expertise, or they lack the motivation and tools to label data. Therefore, the problem of federated learning without full labels is important in real-world FL applications. In this paper, we discuss how the problem can be solved with machine learning techniques that leverage unlabeled data. We present a survey of methods that combine FL with semi-supervised learning, self-supervised learning, and transfer learning methods. We also summarize the datasets used to evaluate FL methods without full labels. Finally, we highlight future directions in the context of FL without full labels.

View on arXiv PDF

Similar