CVApr 11

Radiology Report Generation for Low-Quality X-Ray Images

Hongze Zhu, Chen Hu, Jiaxuan Jiang, Hong Liu, Yawen Huang, Ming Hu, Tianyu Wang, Zhijian Wu, Yefeng Zheng

arXiv:2604.1018873.9h-index: 8

Predicted impact top 37% in CV · last 90 daysOriginality Incremental advance

AI Analysis

For clinical radiology, this work tackles the practical issue of image quality variations, enabling reliable automated reporting in real-world settings.

The paper addresses the problem of performance degradation in radiology report generation when input X-ray images are low-quality. It proposes a robust framework with a dual-loop training strategy that achieves quality-agnostic diagnostic features, mitigating degradation.

Vision-Language Models (VLMs) have significantly advanced automated Radiology Report Generation (RRG). However, existing methods implicitly assume high-quality inputs, overlooking the noise and artifacts prevalent in real-world clinical environments. Consequently, current models exhibit severe performance degradation when processing suboptimal images. To bridge this gap, we propose a robust report generation framework explicitly designed for image quality variations. We first introduce an Automated Quality Assessment Agent (AQAA) to identify low-quality samples within the MIMIC-CXR dataset and establish the Low-quality Radiology Report Generation (LRRG) benchmark. To tackle degradation-induced shifts, we propose a novel Dual-loop Training Strategy leveraging bi-level optimization and gradient consistency. This approach ensures the model learns quality-agnostic diagnostic features by aligning gradient directions across varying quality regimes. Extensive experiments demonstrate that our approach effectively mitigates model performance degradation caused by image quality deterioration. The code and data will be released upon acceptance.

View on arXiv PDF

Similar