CVAug 6, 2024

Biomedical SAM 2: Segment Anything in Biomedical Images and Videos

Zhiling Yan, Weixiang Sun, Rong Zhou, Zhengqing Yuan, Kai Zhang, Yiwei Li, Tianming Liu, Quanzheng Li, Xiang Li, Lifang He, Lichao Sun

arXiv:2408.03286v216.822 citationsh-index: 34Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the need for accurate segmentation in medical diagnostics, representing an incremental adaptation of a foundation model to a specific domain.

The researchers tackled the problem of adapting the Segment Anything Model 2 (SAM-2) for biomedical image and video segmentation, revealing its limitations and developing BioSAM-2, which surpasses existing foundation models and matches or exceeds specialist models in performance.

Medical image segmentation and video object segmentation are essential for diagnosing and analyzing diseases by identifying and measuring biological structures. Recent advances in natural domain have been driven by foundation models like the Segment Anything Model 2 (SAM-2). To explore the performance of SAM-2 in biomedical applications, we designed three evaluation pipelines for single-frame 2D image segmentation, multi-frame 3D image segmentation and multi-frame video segmentation with varied prompt designs, revealing SAM-2's limitations in medical contexts. Consequently, we developed BioSAM-2, an enhanced foundation model optimized for biomedical data based on SAM-2. Our experiments show that BioSAM-2 not only surpasses the performance of existing state-of-the-art foundation models but also matches or even exceeds specialist models, demonstrating its efficacy and potential in the medical domain.

View on arXiv PDF Code

Similar