CVAIDec 15, 2024

Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment

arXiv:2412.11186v12 citationsh-index: 2Has CodeMedSAM@CVPR
Originality Synthesis-oriented
AI Analysis

This work addresses the computational efficiency problem for medical practitioners using segmentation models, but it is incremental as it applies existing quantization techniques to a specific model.

The paper tackles the high computational demands of the MedSAM model for medical image segmentation by introducing a quantization-aware training pipeline, which significantly improves processing speed while maintaining acceptable accuracy.

Medical image segmentation is a critical component of clinical practice, and the state-of-the-art MedSAM model has significantly advanced this field. Nevertheless, critiques highlight that MedSAM demands substantial computational resources during inference. To address this issue, the CVPR 2024 MedSAM on Laptop Challenge was established to find an optimal balance between accuracy and processing speed. In this paper, we introduce a quantization-aware training pipeline designed to efficiently quantize the Segment Anything Model for medical images and deploy it using the OpenVINO inference engine. This pipeline optimizes both training time and disk storage. Our experimental results confirm that this approach considerably enhances processing speed over the baseline, while still achieving an acceptable accuracy level. The training script, inference script, and quantized model are publicly accessible at https://github.com/AVC2-UESTC/QMedSAM.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes