CVJun 20, 2025

Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations

Dongdong Meng, Sheng Li, Hao Wu, Guoping Wang, Xueqing Yan

arXiv:2506.17136v12 citationsh-index: 15MICCAI

Originality Incremental advance

AI Analysis

This addresses the challenge of effective multi-modal learning under semi-supervised conditions for medical image segmentation in complex scenarios, representing an incremental improvement.

The paper tackles the problem of limited annotations in medical image segmentation for complex backgrounds by proposing a semi-supervised multi-modal approach that leverages complementary information across modalities, achieving superior performance and robustness on two multi-modal datasets.

Semi-supervised learning addresses the issue of limited annotations in medical images effectively, but its performance is often inadequate for complex backgrounds and challenging tasks. Multi-modal fusion methods can significantly improve the accuracy of medical image segmentation by providing complementary information. However, they face challenges in achieving significant improvements under semi-supervised conditions due to the challenge of effectively leveraging unlabeled data. There is a significant need to create an effective and reliable multi-modal learning strategy for leveraging unlabeled data in semi-supervised segmentation. To address these issues, we propose a novel semi-supervised multi-modal medical image segmentation approach, which leverages complementary multi-modal information to enhance performance with limited labeled data. Our approach employs a multi-stage multi-modal fusion and enhancement strategy to fully utilize complementary multi-modal information, while reducing feature discrepancies and enhancing feature sharing and alignment. Furthermore, we effectively introduce contrastive mutual learning to constrain prediction consistency across modalities, thereby facilitating the robustness of segmentation results in semi-supervised tasks. Experimental results on two multi-modal datasets demonstrate the superior performance and robustness of the proposed framework, establishing its valuable potential for solving medical image segmentation tasks in complex scenarios.

View on arXiv PDF

Similar