CVAILGIVApr 6, 2021

Efficient Video Compression via Content-Adaptive Super-Resolution

arXiv:2104.02322v167 citations
Originality Incremental advance
AI Analysis

This addresses the problem of inefficient deep learning-based video compression for Internet video delivery, offering a significant improvement over existing methods.

The paper tackles video compression by augmenting existing codecs with a content-adaptive super-resolution model, achieving the same PSNR with 16% of the bits-per-pixel of H.265 and 2% of DVC.

Video compression is a critical component of Internet video delivery. Recent work has shown that deep learning techniques can rival or outperform human-designed algorithms, but these methods are significantly less compute and power-efficient than existing codecs. This paper presents a new approach that augments existing codecs with a small, content-adaptive super-resolution model that significantly boosts video quality. Our method, SRVC, encodes video into two bitstreams: (i) a content stream, produced by compressing downsampled low-resolution video with the existing codec, (ii) a model stream, which encodes periodic updates to a lightweight super-resolution neural network customized for short segments of the video. SRVC decodes the video by passing the decompressed low-resolution video frames through the (time-varying) super-resolution model to reconstruct high-resolution video frames. Our results show that to achieve the same PSNR, SRVC requires 16% of the bits-per-pixel of H.265 in slow mode, and 2% of the bits-per-pixel of DVC, a recent deep learning-based video compression scheme. SRVC runs at 90 frames per second on a NVIDIA V100 GPU.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes