CVSep 21, 2024

Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data

Sneha Paul, Zachary Patterson, Nizar Bouguila

arXiv:2409.13977v13.71 citationsh-index: 15Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of limited labeled data in 3D classification for computer vision applications, representing a strong specific gain rather than a broad paradigm shift.

The paper tackles the problem of underutilizing unlabeled data in 3D semi-supervised learning by proposing AllMatch, a framework with adaptive augmentation, inverse learning, and contrastive modules, achieving up to 11.2% performance improvement with 1% labeled data and matching full supervision with only 10% labeled data.

Semi-supervised learning (SSL) has shown its effectiveness in learning effective 3D representation from a small amount of labelled data while utilizing large unlabelled data. Traditional semi-supervised approaches rely on the fundamental concept of predicting pseudo-labels for unlabelled data and incorporating them into the learning process. However, we identify that the existing methods do not fully utilize all the unlabelled samples and consequently limit their potential performance. To address this issue, we propose AllMatch, a novel SSL-based 3D classification framework that effectively utilizes all the unlabelled samples. AllMatch comprises three modules: (1) an adaptive hard augmentation module that applies relatively hard augmentations to the high-confident unlabelled samples with lower loss values, thereby enhancing the contribution of such samples, (2) an inverse learning module that further improves the utilization of unlabelled data by learning what not to learn, and (3) a contrastive learning module that ensures learning from all the samples in both supervised and unsupervised settings. Comprehensive experiments on two popular 3D datasets demonstrate a performance improvement of up to 11.2% with 1% labelled data, surpassing the SOTA by a significant margin. Furthermore, AllMatch exhibits its efficiency in effectively leveraging all the unlabelled data, demonstrated by the fact that only 10% of labelled data reaches nearly the same performance as fully-supervised learning with all labelled data. The code of our work is available at: https://github.com/snehaputul/AllMatch.

View on arXiv PDF Code

Similar