LGCVMar 22, 2024

Do not trust what you trust: Miscalibration in Semi-supervised Learning

arXiv:2403.15567v13 citationsh-index: 50Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses the issue of overconfidence in pseudo-labeling for SSL, which is crucial for improving reliability in image classification tasks, though it is an incremental improvement.

The paper tackled the problem of miscalibrated uncertainty estimates in semi-supervised learning (SSL) methods that use pseudo-labels, showing they are significantly miscalibrated due to minimizing min-entropy, and proposed a simple penalty term that improved calibration performance and discriminative power across SSL benchmarks.

State-of-the-art semi-supervised learning (SSL) approaches rely on highly confident predictions to serve as pseudo-labels that guide the training on unlabeled samples. An inherent drawback of this strategy stems from the quality of the uncertainty estimates, as pseudo-labels are filtered only based on their degree of uncertainty, regardless of the correctness of their predictions. Thus, assessing and enhancing the uncertainty of network predictions is of paramount importance in the pseudo-labeling process. In this work, we empirically demonstrate that SSL methods based on pseudo-labels are significantly miscalibrated, and formally demonstrate the minimization of the min-entropy, a lower bound of the Shannon entropy, as a potential cause for miscalibration. To alleviate this issue, we integrate a simple penalty term, which enforces the logit distances of the predictions on unlabeled samples to remain low, preventing the network predictions to become overconfident. Comprehensive experiments on a variety of SSL image classification benchmarks demonstrate that the proposed solution systematically improves the calibration performance of relevant SSL models, while also enhancing their discriminative power, being an appealing addition to tackle SSL tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes