MMCVLGMay 9, 2020

Comment on "No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features"

arXiv:2005.04400v11 citations
Originality Synthesis-oriented
AI Analysis

This comment exposes flawed results in a prior study, highlighting reproducibility issues in video quality assessment research.

The authors reimplemented a prior method for blind video quality assessment and found that its reported superior performance was due to data leakage, resulting in results below the state-of-the-art.

In Neural Processing Letters 50,3 (2019) a machine learning approach to blind video quality assessment was proposed. It is based on temporal pooling of features of video frames, taken from the last pooling layer of deep convolutional neural networks. The method was validated on two established benchmark datasets and gave results far better than the previous state-of-the-art. In this letter we report the results from our careful reimplementations. The performance results, claimed in the paper, cannot be reached, and are even below the state-of-the-art by a large margin. We show that the originally reported wrong performance results are a consequence of two cases of data leakage. Information from outside the training dataset was used in the fine-tuning stage and in the model evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes