IVCVDec 14, 2023

RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment

arXiv:2312.08864v22 citationsh-index: 13PCS
Originality Incremental advance
AI Analysis

This addresses deployment limitations for video quality assessment in practical applications, though it is incremental as it builds on an existing method.

The paper tackles the high computational complexity and large memory requirements of deep video quality assessment models by compressing the state-of-the-art RankDVQA method through pruning and knowledge distillation, resulting in RankDVQA-mini which uses less than 10% of parameters while maintaining superior prediction performance.

Deep learning-based video quality assessment (deep VQA) has demonstrated significant potential in surpassing conventional metrics, with promising improvements in terms of correlation with human perception. However, the practical deployment of such deep VQA models is often limited due to their high computational complexity and large memory requirements. To address this issue, we aim to significantly reduce the model size and runtime of one of the state-of-the-art deep VQA methods, RankDVQA, by employing a two-phase workflow that integrates pruning-driven model compression with multi-level knowledge distillation. The resulting lightweight full reference quality metric, RankDVQA-mini, requires less than 10% of the model parameters compared to its full version (14% in terms of FLOPs), while still retaining a quality prediction performance that is superior to most existing deep VQA methods. The source code of the RankDVQA-mini has been released at https://chenfeng-bristol.github.io/RankDVQA-mini/ for public evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes