CVMay 23, 2022

Decoder Denoising Pretraining for Semantic Segmentation

arXiv:2205.11423v128 citationsh-index: 71
Originality Incremental advance
AI Analysis

This addresses label efficiency for semantic segmentation tasks, offering incremental improvements over existing pretraining methods.

The paper tackles the problem of expensive semantic segmentation labels by proposing decoder denoising pretraining, which outperforms encoder-only supervised pretraining and achieves state-of-the-art results on label-efficient segmentation with gains on Cityscapes, Pascal Context, and ADE20K datasets.

Semantic segmentation labels are expensive and time consuming to acquire. Hence, pretraining is commonly used to improve the label-efficiency of segmentation models. Typically, the encoder of a segmentation model is pretrained as a classifier and the decoder is randomly initialized. Here, we argue that random initialization of the decoder can be suboptimal, especially when few labeled examples are available. We propose a decoder pretraining approach based on denoising, which can be combined with supervised pretraining of the encoder. We find that decoder denoising pretraining on the ImageNet dataset strongly outperforms encoder-only supervised pretraining. Despite its simplicity, decoder denoising pretraining achieves state-of-the-art results on label-efficient semantic segmentation and offers considerable gains on the Cityscapes, Pascal Context, and ADE20K datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes