LGCVMay 15, 2022

FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning

CMUPeking U
arXiv:2205.07246v3449 citationsh-index: 58Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of utilizing unlabeled data more effectively in semi-supervised learning, especially with limited labeled data, though it is incremental as it builds on existing pseudo-labeling methods.

The paper tackles the problem of ineffective threshold adjustment in semi-supervised learning by proposing FreeMatch, a method that self-adaptively adjusts confidence thresholds based on the model's learning status, resulting in error rate reductions of 5.78%, 13.59%, and 1.28% over the state-of-the-art on various datasets.

Semi-supervised Learning (SSL) has witnessed great success owing to the impressive performances brought by various methods based on pseudo labeling and consistency regularization. However, we argue that existing methods might fail to utilize the unlabeled data more effectively since they either use a pre-defined / fixed threshold or an ad-hoc threshold adjusting scheme, resulting in inferior performance and slow convergence. We first analyze a motivating example to obtain intuitions on the relationship between the desirable threshold and model's learning status. Based on the analysis, we hence propose FreeMatch to adjust the confidence threshold in a self-adaptive manner according to the model's learning status. We further introduce a self-adaptive class fairness regularization penalty to encourage the model for diverse predictions during the early training stage. Extensive experiments indicate the superiority of FreeMatch especially when the labeled data are extremely rare. FreeMatch achieves 5.78%, 13.59%, and 1.28% error rate reduction over the latest state-of-the-art method FlexMatch on CIFAR-10 with 1 label per class, STL-10 with 4 labels per class, and ImageNet with 100 labels per class, respectively. Moreover, FreeMatch can also boost the performance of imbalanced SSL. The codes can be found at https://github.com/microsoft/Semi-supervised-learning.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes