LGNov 14, 2025

CATCHFed: Efficient Unlabeled Data Utilization for Semi-Supervised Federated Learning in Limited Labels Environments

Byoungjun Park, Pedro Porto Buarque de Gusmão, Dongjin Ji, Minhoe Kim

arXiv:2511.11778v24.11 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses the challenge of label scarcity in federated learning for real-world applications, though it appears incremental as it builds on existing semi-supervised FL methods.

The paper tackles the problem of performance degradation in semi-supervised federated learning when labeled data is scarce, proposing CATCHFed to improve pseudo-label quality and utilize unlabeled data, achieving superior performance in limited-label settings.

Federated learning is a promising paradigm that utilizes distributed client resources while preserving data privacy. Most existing FL approaches assume clients possess labeled data, however, in real-world scenarios, client-side labels are often unavailable. Semi-supervised Federated learning, where only the server holds labeled data, addresses this issue. However, it experiences significant performance degradation as the number of labeled data decreases. To tackle this problem, we propose \textit{CATCHFed}, which introduces client-aware adaptive thresholds considering class difficulty, hybrid thresholds to enhance pseudo-label quality, and utilizes unpseudo-labeled data for consistency regularization. Extensive experiments across various datasets and configurations demonstrate that CATCHFed effectively leverages unlabeled client data, achieving superior performance even in extremely limited-label settings.

View on arXiv PDF

Similar