CVSep 10, 2024

Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts

arXiv:2409.06821v111 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This addresses the time-consuming need for expert manual prompts in medical image segmentation, offering an incremental improvement for ultrasound bone analysis.

The paper tackled the problem of adapting foundation models like SAM for medical image segmentation without manual prompts, particularly in ultrasound images, and achieved Dice score improvements of 2-7% for hip/wrist and up to 33% for shoulder data across three datasets.

Foundation models like the segment anything model require high-quality manual prompts for medical image segmentation, which is time-consuming and requires expertise. SAM and its variants often fail to segment structures in ultrasound (US) images due to domain shift. We propose Sam2Rad, a prompt learning approach to adapt SAM and its variants for US bone segmentation without human prompts. It introduces a prompt predictor network (PPN) with a cross-attention module to predict prompt embeddings from image encoder features. PPN outputs bounding box and mask prompts, and 256-dimensional embeddings for regions of interest. The framework allows optional manual prompting and can be trained end-to-end using parameter-efficient fine-tuning (PEFT). Sam2Rad was tested on 3 musculoskeletal US datasets: wrist (3822 images), rotator cuff (1605 images), and hip (4849 images). It improved performance across all datasets without manual prompts, increasing Dice scores by 2-7% for hip/wrist and up to 33% for shoulder data. Sam2Rad can be trained with as few as 10 labeled images and is compatible with any SAM architecture for automatic segmentation.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes