IVCVNov 26, 2024

cWDM: Conditional Wavelet Diffusion Models for Cross-Modality 3D Medical Image Synthesis

arXiv:2411.17203v114 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This work addresses a specific clinical bottleneck in medical imaging by providing a method to handle missing modalities, though it is incremental as it adapts existing diffusion models to a new application.

The paper tackles the problem of missing MRI modalities in brain tumor segmentation by proposing a conditional Wavelet Diffusion Model (cWDM) to synthesize missing images from available ones, enabling downstream segmentation models to function with incomplete data.

This paper contributes to the "BraTS 2024 Brain MR Image Synthesis Challenge" and presents a conditional Wavelet Diffusion Model (cWDM) for directly solving a paired image-to-image translation task on high-resolution volumes. While deep learning-based brain tumor segmentation models have demonstrated clear clinical utility, they typically require MR scans from various modalities (T1, T1ce, T2, FLAIR) as input. However, due to time constraints or imaging artifacts, some of these modalities may be missing, hindering the application of well-performing segmentation algorithms in clinical routine. To address this issue, we propose a method that synthesizes one missing modality image conditioned on three available images, enabling the application of downstream segmentation models. We treat this paired image-to-image translation task as a conditional generation problem and solve it by combining a Wavelet Diffusion Model for high-resolution 3D image synthesis with a simple conditioning strategy. This approach allows us to directly apply our model to full-resolution volumes, avoiding artifacts caused by slice- or patch-wise data processing. While this work focuses on a specific application, the presented method can be applied to all kinds of paired image-to-image translation problems, such as CT $\leftrightarrow$ MR and MR $\leftrightarrow$ PET translation, or mask-conditioned anatomically guided image generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes