CVSep 1, 2020

Object Detection-Based Variable Quantization Processing

arXiv:2009.00189v1
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for image and video compression systems, aiming to enhance quality without increasing file sizes.

The paper tackles the problem of making traditional image and video encoders content-aware by proposing a preprocessing method that uses object detection to guide adaptive quantization, resulting in higher quality parameters without increasing output size, with experiments showing improved MS-SSIM and enhanced watching experience under the same or similar bitrates.

In this paper, we propose a preprocessing method for conventional image and video encoders that can make these existing encoders content-aware. By going through our process, a higher quality parameter could be set on a traditional encoder without increasing the output size. A still frame or an image will firstly go through an object detector. Either the properties of the detection result will decide the parameters of the following procedures, or the system will be bypassed if no object is detected in the given frame. The processing method utilizes an adaptive quantization process to determine the portion of data to be dropped. This method is primarily based on the JPEG compression theory and is optimum for JPEG-based encoders such as JPEG encoders and the Motion JPEG encoders. However, other DCT-based encoders like MPEG part 2, H.264, etc. can also benefit from this method. In the experiments, we compare the MS-SSIM under the same bitrate as well as similar MS-SSIM but enhanced bitrate. As this method is based on human perception, even with similar MS-SSIM, the overall watching experience will be better than the direct encoded ones.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes