IVCVMay 14, 2024

RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content

arXiv:2405.08621v52 citationsh-index: 13ECCV Workshops
Originality Incremental advance
AI Analysis

This addresses the need for better quality assessment tools for enhanced video content, which is incremental as it adapts existing deep learning techniques to a specific domain.

The paper tackles the problem of assessing video quality for enhanced content, which lacks dedicated metrics, by proposing RMT-BVQA, a blind deep video quality assessment method that achieves superior correlation performance compared to ten existing no-reference metrics on the VDPVE database.

With recent advances in deep learning, numerous algorithms have been developed to enhance video quality, reduce visual artifacts, and improve perceptual quality. However, little research has been reported on the quality assessment of enhanced content - the evaluation of enhancement methods is often based on quality metrics that were designed for compression applications. In this paper, we propose a novel blind deep video quality assessment (VQA) method specifically for enhanced video content. It employs a new Recurrent Memory Transformer (RMT) based network architecture to obtain video quality representations, which is optimized through a novel content-quality-aware contrastive learning strategy based on a new database containing 13K training patches with enhanced content. The extracted quality representations are then combined through linear regression to generate video-level quality indices. The proposed method, RMT-BVQA, has been evaluated on the VDPVE (VQA Dataset for Perceptual Video Enhancement) database through a five-fold cross validation. The results show its superior correlation performance when compared to ten existing no-reference quality metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes