IV CVAug 6, 2024

Segment Anything in Medical Images and Videos: Benchmark and Deployment

Jun Ma, Sumin Kim, Feifei Li, Mohammed Baharoon, Reza Asakereh, Hongwei Lyu, Bo Wang

arXiv:2408.03322v131.170 citationsh-index: 11Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the unclear utility of segmentation foundation models for medical data, providing benchmarking and deployment tools for researchers and practitioners, though it is incremental as it adapts existing models.

The paper benchmarks the Segment Anything Model 2 (SAM2) on 11 medical image and video modalities, comparing it to SAM1 and MedSAM, and develops a transfer learning pipeline to adapt SAM2 for medical use, with code released as a 3D slicer plugin and Gradio API.

Recent advances in segmentation foundation models have enabled accurate and efficient segmentation across a wide range of natural images and videos, but their utility to medical data remains unclear. In this work, we first present a comprehensive benchmarking of the Segment Anything Model 2 (SAM2) across 11 medical image modalities and videos and point out its strengths and weaknesses by comparing it to SAM1 and MedSAM. Then, we develop a transfer learning pipeline and demonstrate SAM2 can be quickly adapted to medical domain by fine-tuning. Furthermore, we implement SAM2 as a 3D slicer plugin and Gradio API for efficient 3D image and video segmentation. The code has been made publicly available at \url{https://github.com/bowang-lab/MedSAM}.

View on arXiv PDF Code

Similar