IVAICVMar 13

Multiscale Structure-Guided Latent Diffusion for Multimodal MRI Translation

arXiv:2603.1258155.4Has Code
AI Analysis

This addresses the challenge of accurate MRI synthesis for medical imaging applications, though it is incremental as it builds on latent diffusion models with specific enhancements.

The paper tackles the problem of anatomical inconsistencies and degraded texture details in multimodal MRI translation under arbitrary missing-modality scenarios, proposing MSG-LDM, which outperforms existing methods by reconstructing complete structures, as demonstrated on BraTS2020 and WMH datasets.

Although diffusion models have achieved remarkable progress in multi-modal magnetic resonance imaging (MRI) translation tasks, existing methods still tend to suffer from anatomical inconsistencies or degraded texture details when handling arbitrary missing-modality scenarios. To address these issues, we propose a latent diffusion-based multi-modal MRI translation framework, termed MSG-LDM. By leveraging the available modalities, the proposed method infers complete structural information, which preserves reliable boundary details. Specifically, we introduce a style--structure disentanglement mechanism in the latent space, which explicitly separates modality-specific style features from shared structural representations, and jointly models low-frequency anatomical layouts and high-frequency boundary details in a multi-scale feature space. During the structure disentanglement stage, high-frequency structural information is explicitly incorporated to enhance feature representations, guiding the model to focus on fine-grained structural cues while learning modality-invariant low-frequency anatomical representations. Furthermore, to reduce interference from modality-specific styles and improve the stability of structure representations, we design a style consistency loss and a structure-aware loss. Extensive experiments on the BraTS2020 and WMH datasets demonstrate that the proposed method outperforms existing MRI synthesis approaches, particularly in reconstructing complete structures. The source code is publicly available at https://github.com/ziyi-start/MSG-LDM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes