CVMar 27, 2021

Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling

arXiv:2103.14858v120 citations
Originality Incremental advance
AI Analysis

It addresses the problem of adapting video resolution for different devices, offering improved upscaling quality, though it is incremental in building on existing invertible neural networks.

This paper tackles the video rescaling task by jointly optimizing downscaling and upscaling, introducing two neural network approaches that outperform image-based and non-joint video methods in quality metrics.

This paper addresses the video rescaling task, which arises from the needs of adapting the video spatial resolution to suit individual viewing devices. We aim to jointly optimize video downscaling and upscaling as a combined task. Most recent studies focus on image-based solutions, which do not consider temporal information. We present two joint optimization approaches based on invertible neural networks with coupling layers. Our Long Short-Term Memory Video Rescaling Network (LSTM-VRN) leverages temporal information in the low-resolution video to form an explicit prediction of the missing high-frequency information for upscaling. Our Multi-input Multi-output Video Rescaling Network (MIMO-VRN) proposes a new strategy for downscaling and upscaling a group of video frames simultaneously. Not only do they outperform the image-based invertible model in terms of quantitative and qualitative results, but also show much improved upscaling quality than the video rescaling methods without joint optimization. To our best knowledge, this work is the first attempt at the joint optimization of video downscaling and upscaling.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes