LG CV APP-PHOct 10, 2022

On the Importance of Calibration in Semi-supervised Learning

Charlotte Loh, Rumen Dangovski, Shivchander Sudalairaj, Seungwook Han, Ligong Han, Leonid Karlinsky, Marin Soljacic, Akash Srivastava

arXiv:2210.04783v18.77 citationsh-index: 98

Originality Incremental advance

AI Analysis

This addresses the issue of confirmation bias in semi-supervised learning for practitioners in vision and other domains, though it is incremental as it builds on existing methods.

The paper tackled the problem of model calibration in semi-supervised learning, showing it is strongly correlated with performance and proposing new models that improve test accuracy by up to 15.9% on standard benchmarks.

State-of-the-art (SOTA) semi-supervised learning (SSL) methods have been highly successful in leveraging a mix of labeled and unlabeled data by combining techniques of consistency regularization and pseudo-labeling. During pseudo-labeling, the model's predictions on unlabeled data are used for training and thus, model calibration is important in mitigating confirmation bias. Yet, many SOTA methods are optimized for model performance, with little focus directed to improve model calibration. In this work, we empirically demonstrate that model calibration is strongly correlated with model performance and propose to improve calibration via approximate Bayesian techniques. We introduce a family of new SSL models that optimizes for calibration and demonstrate their effectiveness across standard vision benchmarks of CIFAR-10, CIFAR-100 and ImageNet, giving up to 15.9% improvement in test accuracy. Furthermore, we also demonstrate their effectiveness in additional realistic and challenging problems, such as class-imbalanced datasets and in photonics science.

View on arXiv PDF

Similar