IVAICVFeb 26, 2025

Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis

arXiv:2502.19390v2h-index: 14
Originality Incremental advance
AI Analysis

This work addresses the challenge of incomplete multi-modal MRI data in clinical settings for brain tumor diagnosis, representing an incremental improvement in generative models for medical imaging.

The paper tackled the problem of synthesizing missing MRI modalities in brain tumor imaging by integrating multi-modal contrastive learning with entropy-based feature selection and joint segmentation, resulting in improved performance in generating high-quality target images as demonstrated in the Brain MR Image Synthesis challenge.

Multi-modal magnetic resonance imaging (MRI) is essential for providing complementary information about brain anatomy and pathology, leading to more accurate diagnoses. However, obtaining high-quality multi-modal MRI in a clinical setting is difficult due to factors such as time constraints, high costs, and patient movement artifacts. To overcome this difficulty, there is increasing interest in developing generative models that can synthesize missing target modality images from the available source ones. Therefore, our team, PLAVE, design a generative model for missing MRI that integrates multi-modal contrastive learning with a focus on critical tumor regions. Specifically, we integrate multi-modal contrastive learning, tailored for multiple source modalities, and enhance its effectiveness by selecting features based on entropy during the contrastive learning process. Additionally, our network not only generates the missing target modality images but also predicts segmentation outputs, simultaneously. This approach improves the generator's capability to precisely generate tumor regions, ultimately improving performance in downstream segmentation tasks. By leveraging a combination of contrastive, segmentation, and additional self-representation losses, our model effectively reflects target-specific information and generate high-quality target images. Consequently, our results in the Brain MR Image Synthesis challenge demonstrate that the proposed model excelled in generating the missing modality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes