CVSep 7, 2024

SSFam: Scribble Supervised Salient Object Detection Family

arXiv:2409.04817v110 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficient salient object detection in complex scenes using sparse scribble labels, offering a unified approach for multiple modalities, though it is incremental by building on SAM.

The paper tackles the problem of scribble-supervised salient object detection across multiple image modalities (RGB, depth, thermal) by proposing SSFam, a family of models based on the Segment Anything Model (SAM), which achieves state-of-the-art performance and approaches fully supervised methods.

Scribble supervised salient object detection (SSSOD) constructs segmentation ability of attractive objects from surroundings under the supervision of sparse scribble labels. For the better segmentation, depth and thermal infrared modalities serve as the supplement to RGB images in the complex scenes. Existing methods specifically design various feature extraction and multi-modal fusion strategies for RGB, RGB-Depth, RGB-Thermal, and Visual-Depth-Thermal image input respectively, leading to similar model flood. As the recently proposed Segment Anything Model (SAM) possesses extraordinary segmentation and prompt interactive capability, we propose an SSSOD family based on SAM, named SSFam, for the combination input with different modalities. Firstly, different modal-aware modulators are designed to attain modal-specific knowledge which cooperates with modal-agnostic information extracted from the frozen SAM encoder for the better feature ensemble. Secondly, a siamese decoder is tailored to bridge the gap between the training with scribble prompt and the testing with no prompt for the stronger decoding ability. Our model demonstrates the remarkable performance among combinations of different modalities and refreshes the highest level of scribble supervised methods and comes close to the ones of fully supervised methods. https://github.com/liuzywen/SSFam

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes