CVAIFeb 3, 2025

CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP

arXiv:2502.01707v17 citationsh-index: 14Has CodeISCAS
Originality Synthesis-oriented
AI Analysis

This work addresses the evaluation of dehazing algorithms for image processing applications, but it is incremental as it applies a pre-trained model to a specific domain with limited dataset improvements.

The paper tackles the problem of blind dehazed image quality assessment (BDQA) by adapting CLIP to evaluate dehazed images from global and local perspectives, achieving more accurate quality predictions over existing methods on two authentic datasets.

Blind dehazed image quality assessment (BDQA), which aims to accurately predict the visual quality of dehazed images without any reference information, is essential for the evaluation, comparison, and optimization of image dehazing algorithms. Existing learning-based BDQA methods have achieved remarkable success, while the small scale of DQA datasets limits their performance. To address this issue, in this paper, we propose to adapt Contrastive Language-Image Pre-Training (CLIP), pre-trained on large-scale image-text pairs, to the BDQA task. Specifically, inspired by the fact that the human visual system understands images based on hierarchical features, we take global and local information of the dehazed image as the input of CLIP. To accurately map the input hierarchical information of dehazed images into the quality score, we tune both the vision branch and language branch of CLIP with prompt learning. Experimental results on two authentic DQA datasets demonstrate that our proposed approach, named CLIP-DQA, achieves more accurate quality predictions over existing BDQA methods. The code is available at https://github.com/JunFu1995/CLIP-DQA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes