CV LG IVSep 21, 2022

Learning from Mixed Datasets: A Monotonic Image Quality Assessment Model

Zhaopeng Feng, Keyang Zhang, Shuyue Jia, Baoliang Chen, Shiqi Wang

arXiv:2209.10451v34.811 citationsh-index: 19Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of combining different IQA datasets with varying annotation criteria for researchers in computer vision, though it is incremental as it builds on existing mixed dataset training approaches.

The paper tackles the problem of deep learning models overfitting to specific scenes in image quality assessment (IQA) by proposing a monotonic neural network that learns from mixed datasets without aligning annotations, achieving improved generalization as verified by experimental results.

Deep learning based image quality assessment (IQA) models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is nontrivial to combine different IQA datasets, as their quality evaluation criteria, score ranges, view conditions, as well as subjects are usually not shared during the image quality annotation. In this paper, instead of aligning the annotations, we propose a monotonic neural network for IQA model learning with different datasets combined. In particular, our model consists of a dataset-shared quality regressor and several dataset-specific quality transformers. The quality regressor aims to obtain the perceptual qualities of each dataset while each quality transformer maps the perceptual qualities to the corresponding dataset annotations with their monotonicity maintained. The experimental results verify the effectiveness of the proposed learning strategy and our code is available at https://github.com/fzp0424/MonotonicIQA.

View on arXiv PDF Code

Similar