CVAIETAug 22, 2024

SAM-SP: Self-Prompting Makes SAM Great Again

arXiv:2408.12364v110 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses the practical limitation of requiring expert prompts for domain-specific segmentation, making SAM more applicable in fields like medical imaging, though it is incremental as it builds on existing fine-tuning strategies.

The paper tackles the performance degradation of the Segment Anything Model (SAM) in specific domains like medical images by introducing SAM-SP, a self-prompting fine-tuning approach that uses model outputs as prompts to reduce reliance on expert prompts, achieving superior segmentation performance compared to state-of-the-art methods.

The recently introduced Segment Anything Model (SAM), a Visual Foundation Model (VFM), has demonstrated impressive capabilities in zero-shot segmentation tasks across diverse natural image datasets. Despite its success, SAM encounters noticeably performance degradation when applied to specific domains, such as medical images. Current efforts to address this issue have involved fine-tuning strategies, intended to bolster the generalizability of the vanilla SAM. However, these approaches still predominantly necessitate the utilization of domain specific expert-level prompts during the evaluation phase, which severely constrains the model's practicality. To overcome this limitation, we introduce a novel self-prompting based fine-tuning approach, called SAM-SP, tailored for extending the vanilla SAM model. Specifically, SAM-SP leverages the output from the previous iteration of the model itself as prompts to guide subsequent iteration of the model. This self-prompting module endeavors to learn how to generate useful prompts autonomously and alleviates the dependence on expert prompts during the evaluation phase, significantly broadening SAM's applicability. Additionally, we integrate a self-distillation module to enhance the self-prompting process further. Extensive experiments across various domain specific datasets validate the effectiveness of the proposed SAM-SP. Our SAM-SP not only alleviates the reliance on expert prompts but also exhibits superior segmentation performance comparing to the state-of-the-art task-specific segmentation approaches, the vanilla SAM, and SAM-based approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes