CVJul 4, 2024

Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification

Xuerong Zhang, Li Huang, Jing Lv, Ming Yang

arXiv:2407.03596v15 citationsh-index: 3

Originality Incremental advance

AI Analysis

This addresses inefficiencies in semi-supervised learning for image classification, offering incremental improvements in data utilization.

The paper tackles the problems of suboptimal threshold selection and information loss in pseudo-labeling for semi-supervised image classification by proposing a self-adaptive threshold strategy and an unreliable sample contrastive loss, achieving superior performance on benchmarks.

Semi-supervised learning is attracting blooming attention, due to its success in combining unlabeled data. However, pseudo-labeling-based semi-supervised approaches suffer from two problems in image classification: (1) Existing methods might fail to adopt suitable thresholds since they either use a pre-defined/fixed threshold or an ad-hoc threshold adjusting scheme, resulting in inferior performance and slow convergence. (2) Discarding unlabeled data with confidence below the thresholds results in the loss of discriminating information. To solve these issues, we develop an effective method to make sufficient use of unlabeled data. Specifically, we design a self adaptive threshold pseudo-labeling strategy, which thresholds for each class can be dynamically adjusted to increase the number of reliable samples. Meanwhile, in order to effectively utilise unlabeled data with confidence below the thresholds, we propose an unreliable sample contrastive loss to mine the discriminative information in low-confidence samples by learning the similarities and differences between sample features. We evaluate our method on several classification benchmarks under partially labeled settings and demonstrate its superiority over the other approaches.

View on arXiv PDF

Similar