CVLGIVSep 21, 2022

Learning from Mixed Datasets: A Monotonic Image Quality Assessment Model

arXiv:2209.10451v311 citationsh-index: 19Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of combining different IQA datasets with varying annotation criteria for researchers in computer vision, though it is incremental as it builds on existing mixed dataset training approaches.

The paper tackles the problem of deep learning models overfitting to specific scenes in image quality assessment (IQA) by proposing a monotonic neural network that learns from mixed datasets without aligning annotations, achieving improved generalization as verified by experimental results.

Deep learning based image quality assessment (IQA) models usually learn to predict image quality from a single dataset, leading the model to overfit specific scenes. To account for this, mixed datasets training can be an effective way to enhance the generalization capability of the model. However, it is nontrivial to combine different IQA datasets, as their quality evaluation criteria, score ranges, view conditions, as well as subjects are usually not shared during the image quality annotation. In this paper, instead of aligning the annotations, we propose a monotonic neural network for IQA model learning with different datasets combined. In particular, our model consists of a dataset-shared quality regressor and several dataset-specific quality transformers. The quality regressor aims to obtain the perceptual qualities of each dataset while each quality transformer maps the perceptual qualities to the corresponding dataset annotations with their monotonicity maintained. The experimental results verify the effectiveness of the proposed learning strategy and our code is available at https://github.com/fzp0424/MonotonicIQA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes