CV AI LGJun 10, 2021

DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning

arXiv:2106.05682v222.288 citationsHas Code

Originality Incremental advance

AI Analysis

It addresses a relatively under-explored issue in semi-supervised learning for real-world applications with imbalanced data, representing an incremental advancement.

The paper tackles the problem of biased pseudo-labels in semi-supervised learning due to class imbalance and distribution mismatch, proposing the DASO framework that improves SSL learners by blending semantic and linear pseudo-labels and using a semantic alignment loss, achieving reliable improvements across multiple imbalanced benchmarks.

The capability of the traditional semi-supervised learning (SSL) methods is far from real-world application due to severely biased pseudo-labels caused by (1) class imbalance and (2) class distribution mismatch between labeled and unlabeled data. This paper addresses such a relatively under-explored problem. First, we propose a general pseudo-labeling framework that class-adaptively blends the semantic pseudo-label from a similarity-based classifier to the linear one from the linear classifier, after making the observation that both types of pseudo-labels have complementary properties in terms of bias. We further introduce a novel semantic alignment loss to establish balanced feature representation to reduce the biased predictions from the classifier. We term the whole framework as Distribution-Aware Semantics-Oriented (DASO) Pseudo-label. We conduct extensive experiments in a wide range of imbalanced benchmarks: CIFAR10/100-LT, STL10-LT, and large-scale long-tailed Semi-Aves with open-set class, and demonstrate that, the proposed DASO framework reliably improves SSL learners with unlabeled data especially when both (1) class imbalance and (2) distribution mismatch dominate.

View on arXiv PDF Code

Similar