CVMMIVAug 19, 2024

Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement

arXiv:2408.09920v12 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This work addresses limitations in capturing human perception for image quality assessment, offering an incremental enhancement to existing models.

The authors tackled the problem of enhancing full-reference image quality assessment (FR-IQA) models by proposing a training-free human visual attention estimation strategy using sliced maximal information coefficient, which consistently improved existing IQA models in experiments.

Full-reference image quality assessment (FR-IQA) models generally operate by measuring the visual differences between a degraded image and its reference. However, existing FR-IQA models including both the classical ones (eg, PSNR and SSIM) and deep-learning based measures (eg, LPIPS and DISTS) still exhibit limitations in capturing the full perception characteristics of the human visual system (HVS). In this paper, instead of designing a new FR-IQA measure, we aim to explore a generalized human visual attention estimation strategy to mimic the process of human quality rating and enhance existing IQA models. In particular, we model human attention generation by measuring the statistical dependency between the degraded image and the reference image. The dependency is captured in a training-free manner by our proposed sliced maximal information coefficient and exhibits surprising generalization in different IQA measures. Experimental results verify the performance of existing IQA models can be consistently improved when our attention module is incorporated. The source code is available at https://github.com/KANGX99/SMIC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes