CVJan 2, 2025

Generalized Task-Driven Medical Image Quality Enhancement with Gradient Promotion

arXiv:2501.01114v16 citationsh-index: 12IEEE Trans Pattern Anal Mach Intell
Originality Incremental advance
AI Analysis

This work addresses image quality enhancement for medical imaging, offering a novel training approach that is incremental in improving existing task-driven models.

The paper tackles the problem of task-driven medical image quality enhancement by addressing conflicting feature requirements across different vision tasks, proposing a gradient promotion training strategy that improves performance over state-of-the-art methods on four public datasets.

Thanks to the recent achievements in task-driven image quality enhancement (IQE) models like ESTR, the image enhancement model and the visual recognition model can mutually enhance each other's quantitation while producing high-quality processed images that are perceivable by our human vision systems. However, existing task-driven IQE models tend to overlook an underlying fact -- different levels of vision tasks have varying and sometimes conflicting requirements of image features. To address this problem, this paper proposes a generalized gradient promotion (GradProm) training strategy for task-driven IQE of medical images. Specifically, we partition a task-driven IQE system into two sub-models, i.e., a mainstream model for image enhancement and an auxiliary model for visual recognition. During training, GradProm updates only parameters of the image enhancement model using gradients of the visual recognition model and the image enhancement model, but only when gradients of these two sub-models are aligned in the same direction, which is measured by their cosine similarity. In case gradients of these two sub-models are not in the same direction, GradProm only uses the gradient of the image enhancement model to update its parameters. Theoretically, we have proved that the optimization direction of the image enhancement model will not be biased by the auxiliary visual recognition model under the implementation of GradProm. Empirically, extensive experimental results on four public yet challenging medical image datasets demonstrated the superior performance of GradProm over existing state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes