IVAICVJul 22, 2025

Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis

arXiv:2507.16579v1h-index: 3Has CodeIJCNN
Originality Incremental advance
AI Analysis

This work addresses a critical issue in clinical workflows by enabling more efficient and accurate medical image synthesis, though it is incremental as it builds on existing diffusion models with multi-scale hierarchical improvements.

The paper tackles the problem of missing medical imaging modalities by proposing the Pyramid Hierarchical Masked Diffusion Model (PHMDiff), which achieves superior performance in PSNR and SSIM on two challenging datasets, producing high-quality synthesized images with excellent structural integrity.

Medical image synthesis plays a crucial role in clinical workflows, addressing the common issue of missing imaging modalities due to factors such as extended scan times, scan corruption, artifacts, patient motion, and intolerance to contrast agents. The paper presents a novel image synthesis network, the Pyramid Hierarchical Masked Diffusion Model (PHMDiff), which employs a multi-scale hierarchical approach for more detailed control over synthesizing high-quality images across different resolutions and layers. Specifically, this model utilizes randomly multi-scale high-proportion masks to speed up diffusion model training, and balances detail fidelity and overall structure. The integration of a Transformer-based Diffusion model process incorporates cross-granularity regularization, modeling the mutual information consistency across each granularity's latent spaces, thereby enhancing pixel-level perceptual accuracy. Comprehensive experiments on two challenging datasets demonstrate that PHMDiff achieves superior performance in both the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM), highlighting its capability to produce high-quality synthesized images with excellent structural integrity. Ablation studies further confirm the contributions of each component. Furthermore, the PHMDiff model, a multi-scale image synthesis framework across and within medical imaging modalities, shows significant advantages over other methods. The source code is available at https://github.com/xiaojiao929/PHMDiff

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes