Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation
This addresses the problem of applying SAM to medical images for researchers and practitioners, offering an incremental adaptation with automated prompts and 3D integration.
The paper tackles the challenge of adapting the Segment Anything Model (SAM) to medical image segmentation by proposing Self-Prompt-SAM, which automatically generates prompts and integrates 3D information, achieving state-of-the-art performance with improvements of 2.3% on AMOS2022, 1.6% on ACDC, and 0.5% on Synapse datasets.
Segment Anything Model (SAM) has demonstrated impressive zero-shot performance and brought a range of unexplored capabilities to natural image segmentation tasks. However, as a very important branch of image segmentation, the performance of SAM remains uncertain when applied to medical image segmentation due to the significant differences between natural images and medical images. Meanwhile, it is harsh to meet the SAM's requirements of extra prompts provided, such as points or boxes to specify medical regions. In this paper, we propose a novel self-prompt SAM adaptation framework for medical image segmentation, named Self-Prompt-SAM. We design a multi-scale prompt generator combined with the image encoder in SAM to generate auxiliary masks. Then, we use the auxiliary masks to generate bounding boxes as box prompts and use Distance Transform to select the most central points as point prompts. Meanwhile, we design a 3D depth-fused adapter (DfusedAdapter) and inject the DFusedAdapter into each transformer in the image encoder and mask decoder to enable pre-trained 2D SAM models to extract 3D information and adapt to 3D medical images. Extensive experiments demonstrate that our method achieves state-of-the-art performance and outperforms nnUNet by 2.3% on AMOS2022, 1.6% on ACDCand 0.5% on Synapse datasets.