CV AI LGMay 9, 2023

Segment Anything Model (SAM) Enhanced Pseudo Labels for Weakly Supervised Semantic Segmentation

Tianle Chen, Zheda Mai, Ruiwen Li, Wei-lun Chao

arXiv:2305.05803v423.085 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of reducing annotation effort in semantic segmentation for computer vision applications, but it is incremental as it builds on existing methods by integrating SAM.

The paper tackles the problem of inaccurate object boundaries in weakly supervised semantic segmentation by using the Segment Anything Model (SAM) to enhance pseudo-labels, achieving consistent gains over state-of-the-art methods on PASCAL VOC and MS-COCO datasets.

Weakly supervised semantic segmentation (WSSS) aims to bypass the need for laborious pixel-level annotation by using only image-level annotation. Most existing methods rely on Class Activation Maps (CAM) to derive pixel-level pseudo-labels and use them to train a fully supervised semantic segmentation model. Although these pseudo-labels are class-aware, indicating the coarse regions for particular classes, they are not object-aware and fail to delineate accurate object boundaries. To address this, we introduce a simple yet effective method harnessing the Segment Anything Model (SAM), a class-agnostic foundation model capable of producing fine-grained instance masks of objects, parts, and subparts. We use CAM pseudo-labels as cues to select and combine SAM masks, resulting in high-quality pseudo-labels that are both class-aware and object-aware. Our approach is highly versatile and can be easily integrated into existing WSSS methods without any modification. Despite its simplicity, our approach shows consistent gain over the state-of-the-art WSSS methods on both PASCAL VOC and MS-COCO datasets.

View on arXiv PDF Code

Similar