IV CV LG MMOct 9, 2020

Video Quality Enhancement Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients in MPEG I-frames

Antonio J G Busson, Paulo R C Mendes, Daniel de S Moraes, Álvaro M da Veiga, Álan L V Guedes, Sérgio Colcher

arXiv:2010.05760v13.73 citations

Originality Incremental advance

AI Analysis

This addresses video compression artifacts for users of MPEG-encoded media, but it is incremental as it builds on existing frequency-domain approaches.

The paper tackles video quality enhancement by predicting missing quantized DCT coefficients in MPEG I-frames using a deep learning model, improving frames from Quality Factor 10 to near 20.

Recent works have successfully applied some types of Convolutional Neural Networks (CNNs) to reduce the noticeable distortion resulting from the lossy JPEG/MPEG compression technique. Most of them are built upon the processing made on the spatial domain. In this work, we propose a MPEG video decoder that is purely based on the frequency-to-frequency domain: it reads the quantized DCT coefficients received from a low-quality I-frames bitstream and, using a deep learning-based model, predicts the missing coefficients in order to recompose the same frames with enhanced quality. In experiments with a video dataset, our best model was able to improve from frames with quantized DCT coefficients corresponding to a Quality Factor (QF) of 10 to enhanced quality frames with QF slightly near to 20.

View on arXiv PDF

Similar