CVJan 18, 2022

MuSCLe: A Multi-Strategy Contrastive Learning Framework for Weakly Supervised Semantic Segmentation

arXiv:2201.07021v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing annotation costs in semantic segmentation for computer vision applications, though it appears incremental as it builds on existing contrastive learning methods.

The paper tackles the problem of weakly supervised semantic segmentation (WSSS) by proposing a multi-strategy contrastive learning framework to enhance feature representations, resulting in state-of-the-art performance on the PASCAL VOC 2012 dataset.

Weakly supervised semantic segmentation (WSSS) has gained significant popularity since it relies only on weak labels such as image level annotations rather than pixel level annotations required by supervised semantic segmentation (SSS) methods. Despite drastically reduced annotation costs, typical feature representations learned from WSSS are only representative of some salient parts of objects and less reliable compared to SSS due to the weak guidance during training. In this paper, we propose a novel Multi-Strategy Contrastive Learning (MuSCLe) framework to obtain enhanced feature representations and improve WSSS performance by exploiting similarity and dissimilarity of contrastive sample pairs at image, region, pixel and object boundary levels. Extensive experiments demonstrate the effectiveness of our method and show that MuSCLe outperforms the current state-of-the-art on the widely used PASCAL VOC 2012 dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes