CVAILGMar 22, 2024

Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations

arXiv:2403.15218v13 citationsh-index: 14
Originality Synthesis-oriented
AI Analysis

This addresses the labor-intensive problem of medical image annotation for researchers and clinicians, but it is incremental as it evaluates an existing foundation model in a new setting.

The study investigated using the Segment Anything Model (SAM) to generate medical image annotations from non-experts for training 3D nnU-Net models, finding that while SAM annotations had high Dice scores, models trained on them performed significantly worse than those trained on ground-truth annotations (p<0.001).

Curating annotations for medical image segmentation is a labor-intensive and time-consuming task that requires domain expertise, resulting in "narrowly" focused deep learning (DL) models with limited translational utility. Recently, foundation models like the Segment Anything Model (SAM) have revolutionized semantic segmentation with exceptional zero-shot generalizability across various domains, including medical imaging, and hold a lot of promise for streamlining the annotation process. However, SAM has yet to be evaluated in a crowd-sourced setting to curate annotations for training 3D DL segmentation models. In this work, we explore the potential of SAM for crowd-sourcing "sparse" annotations from non-experts to generate "dense" segmentation masks for training 3D nnU-Net models, a state-of-the-art DL segmentation model. Our results indicate that while SAM-generated annotations exhibit high mean Dice scores compared to ground-truth annotations, nnU-Net models trained on SAM-generated annotations perform significantly worse than nnU-Net models trained on ground-truth annotations ($p<0.001$, all).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes