IVCVApr 15, 2025

Lightweight Medical Image Restoration via Integrating Reliable Lesion-Semantic Driven Prior

arXiv:2504.11286v29 citationsh-index: 25MM
Originality Incremental advance
AI Analysis

This work addresses efficiency and reliability issues in medical image restoration for clinical applications, representing an incremental improvement over existing deep learning methods.

The paper tackles the problem of computationally inefficient and unreliable medical image restoration by proposing LRformer, a lightweight Transformer method that integrates reliable lesion-semantic priors and reduces computational complexity via frequency domain processing, achieving superior results in multiple tasks with nearly half the computational cost of naive cross-attention.

Medical image restoration tasks aim to recover high-quality images from degraded observations, exhibiting emergent desires in many clinical scenarios, such as low-dose CT image denoising, MRI super-resolution, and MRI artifact removal. Despite the success achieved by existing deep learning-based restoration methods with sophisticated modules, they struggle with rendering computationally-efficient reconstruction results. Moreover, they usually ignore the reliability of the restoration results, which is much more urgent in medical systems. To alleviate these issues, we present LRformer, a Lightweight Transformer-based method via Reliability-guided learning in the frequency domain. Specifically, inspired by the uncertainty quantification in Bayesian neural networks (BNNs), we develop a Reliable Lesion-Semantic Prior Producer (RLPP). RLPP leverages Monte Carlo (MC) estimators with stochastic sampling operations to generate sufficiently-reliable priors by performing multiple inferences on the foundational medical image segmentation model, MedSAM. Additionally, instead of directly incorporating the priors in the spatial domain, we decompose the cross-attention (CA) mechanism into real symmetric and imaginary anti-symmetric parts via fast Fourier transform (FFT), resulting in the design of the Guided Frequency Cross-Attention (GFCA) solver. By leveraging the conjugated symmetric property of FFT, GFCA reduces the computational complexity of naive CA by nearly half. Extensive experimental results in various tasks demonstrate the superiority of the proposed LRformer in both effectiveness and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes