IV CVFeb 4, 2025

UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentation

Weiren Zhao, Feng Wang, Yanran Wang, Yutong Xie, Qi Wu, Yuyin Zhou

arXiv:2502.02024v111.37 citationsh-index: 10Has Code

Originality Incremental advance

AI Analysis

This work addresses segmentation accuracy in medical imaging, which is crucial for diagnosis and treatment planning, though it appears incremental as it builds on the existing Mamba framework with specific modifications.

The paper tackles the challenge of medical image segmentation by addressing Mamba's difficulty in modeling local features and ambiguous boundaries, proposing UD-Mamba which incorporates channel uncertainty into scanning mechanisms to improve precision, achieving robust performance validated across three medical imaging datasets.

Recent advancements have highlighted the Mamba framework, a state-space model known for its efficiency in capturing long-range dependencies with linear computational complexity. While Mamba has shown competitive performance in medical image segmentation, it encounters difficulties in modeling local features due to the sporadic nature of traditional location-based scanning methods and the complex, ambiguous boundaries often present in medical images. To overcome these challenges, we propose Uncertainty-Driven Mamba (UD-Mamba), which redefines the pixel-order scanning process by incorporating channel uncertainty into the scanning mechanism. UD-Mamba introduces two key scanning techniques: 1) sequential scanning, which prioritizes regions with high uncertainty by scanning in a row-by-row fashion, and 2) skip scanning, which processes columns vertically, moving from high-to-low or low-to-high uncertainty at fixed intervals. Sequential scanning efficiently clusters high-uncertainty regions, such as boundaries and foreground objects, to improve segmentation precision, while skip scanning enhances the interaction between background and foreground regions, allowing for timely integration of background information to support more accurate foreground inference. Recognizing the advantages of scanning from certain to uncertain areas, we introduce four learnable parameters to balance the importance of features extracted from different scanning methods. Additionally, a cosine consistency loss is employed to mitigate the drawbacks of transitioning between uncertain and certain regions during the scanning process. Our method demonstrates robust segmentation performance, validated across three distinct medical imaging datasets involving pathology, dermatological lesions, and cardiac tasks.

View on arXiv PDF Code

Similar