CVNCQMDec 13, 2024

RAID-Database: human Responses to Affine Image Distortions

arXiv:2412.10211v21 citationsh-index: 27
Originality Synthesis-oriented
AI Analysis

This provides a domain-specific resource for training models to predict human perception of affine image distortions, which are incremental as it extends existing databases with new data.

The paper tackles the lack of image quality databases for affine distortions common in natural conditions by presenting a dataset of human responses to suprathreshold affine transforms and Gaussian noise, involving 105 observers and over 20,000 comparisons on 864 distorted images.

Image quality databases are used to train models for predicting subjective human perception. However, most existing databases focus on distortions commonly found in digital media and not in natural conditions. Affine transformations are particularly relevant to study, as they are among the most commonly encountered by human observers in everyday life. This Data Descriptor presents a set of human responses to suprathreshold affine image transforms (rotation, translation, scaling) and Gaussian noise as convenient reference to compare with previously existing image quality databases. The responses were measured using well established psychophysics: the Maximum Likelihood Difference Scaling method. The set contains responses to 864 distorted images. The experiments involved 105 observers and more than 20000 comparisons of quadruples of images. The quality of the dataset is ensured because (a) it reproduces the classical Piéron's law, (b) it reproduces classical absolute detection thresholds, and (c) it is consistent with conventional image quality databases but improves them according to Group-MAD experiments.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes