CVMar 8, 2022

Weakly Supervised Semantic Segmentation using Out-of-Distribution Data

arXiv:2203.03860v1119 citationsh-index: 38
Originality Highly original
AI Analysis

This addresses a fundamental limitation in weakly supervised semantic segmentation for computer vision applications, though it is incremental as it builds on existing classifier-based methods.

The paper tackles the problem of spurious correlations between foreground and background cues in weakly supervised semantic segmentation by using out-of-distribution data, achieving state-of-the-art performance on Pascal VOC 2012.

Weakly supervised semantic segmentation (WSSS) methods are often built on pixel-level localization maps obtained from a classifier. However, training on class labels only, classifiers suffer from the spurious correlation between foreground and background cues (e.g. train and rail), fundamentally bounding the performance of WSSS. There have been previous endeavors to address this issue with additional supervision. We propose a novel source of information to distinguish foreground from the background: Out-of-Distribution (OoD) data, or images devoid of foreground object classes. In particular, we utilize the hard OoDs that the classifier is likely to make false-positive predictions. These samples typically carry key visual features on the background (e.g. rail) that the classifiers often confuse as foreground (e.g. train), so these cues let classifiers correctly suppress spurious background cues. Acquiring such hard OoDs does not require an extensive amount of annotation efforts; it only incurs a few additional image-level labeling costs on top of the original efforts to collect class labels. We propose a method, W-OoD, for utilizing the hard OoDs. W-OoD achieves state-of-the-art performance on Pascal VOC 2012.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes