Appeal prediction for AI up-scaled Images
This work addresses the need for better evaluation metrics in image up-scaling for researchers and practitioners, though it is incremental as it builds on existing methods and datasets.
The paper tackles the lack of comprehensive evaluation for AI-based image up-scaling methods by creating a dataset of 1,496 annotated images from 136 base images and five up-scaling methods, and finds that Real-ESRGAN and BSRGAN perform best in appeal. It also trains a DNN to detect up-scaling methods with good performance and develops two models for appeal prediction, with a transfer learning approach achieving the best results.
DNN- or AI-based up-scaling algorithms are gaining in popularity due to the improvements in machine learning. Various up-scaling models using CNNs, GANs or mixed approaches have been published. The majority of models are evaluated using PSRN and SSIM or only a few example images. However, a performance evaluation with a wide range of real-world images and subjective evaluation is missing, which we tackle in the following paper. For this reason, we describe our developed dataset, which uses 136 base images and five different up-scaling methods, namely Real-ESRGAN, BSRGAN, waifu2x, KXNet, and Lanczos. Overall the dataset consists of 1496 annotated images. The labeling of our dataset focused on image appeal and has been performed using crowd-sourcing employing our open-source tool AVRate Voyager. We evaluate the appeal of the different methods, and the results indicate that Real-ESRGAN and BSRGAN are the best. Furthermore, we train a DNN to detect which up-scaling method has been used, the trained models have a good overall performance in our evaluation. In addition to this, we evaluate state-of-the-art image appeal and quality models, here none of the models showed a high prediction performance, therefore we also trained two own approaches. The first uses transfer learning and has the best performance, and the second model uses signal-based features and a random forest model with good overall performance. We share the data and implementation to allow further research in the context of open science.