IVCVAug 6, 2024

Segment Anything in Medical Images and Videos: Benchmark and Deployment

arXiv:2408.03322v170 citationsh-index: 11Has Code
AI Analysis

This work addresses the unclear utility of segmentation foundation models for medical data, providing benchmarking and deployment tools for researchers and practitioners, though it is incremental as it adapts existing models.

The paper benchmarks the Segment Anything Model 2 (SAM2) on 11 medical image and video modalities, comparing it to SAM1 and MedSAM, and develops a transfer learning pipeline to adapt SAM2 for medical use, with code released as a 3D slicer plugin and Gradio API.

Recent advances in segmentation foundation models have enabled accurate and efficient segmentation across a wide range of natural images and videos, but their utility to medical data remains unclear. In this work, we first present a comprehensive benchmarking of the Segment Anything Model 2 (SAM2) across 11 medical image modalities and videos and point out its strengths and weaknesses by comparing it to SAM1 and MedSAM. Then, we develop a transfer learning pipeline and demonstrate SAM2 can be quickly adapted to medical domain by fine-tuning. Furthermore, we implement SAM2 as a 3D slicer plugin and Gradio API for efficient 3D image and video segmentation. The code has been made publicly available at \url{https://github.com/bowang-lab/MedSAM}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes