CVMar 29, 2017

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Nick Johnston, Damien Vincent, David Minnen, Michele Covell, Saurabh Singh, Troy Chinen, Sung Jin Hwang, Joel Shor, George Toderici

arXiv:1703.10114v127.7399 citationsh-index: 30

Originality Incremental advance

AI Analysis

This work improves image compression quality for applications like storage and transmission, though it is incremental over existing deep learning approaches.

The authors tackled lossy image compression by proposing a recurrent convolutional neural network method that outperforms BPG, WebP, JPEG2000, and JPEG in MS-SSIM metrics, achieving state-of-the-art results on Kodak and Tecnick image sets.

We propose a method for lossy image compression based on recurrent, convolutional neural networks that outperforms BPG (4:2:0 ), WebP, JPEG2000, and JPEG as measured by MS-SSIM. We introduce three improvements over previous research that lead to this state-of-the-art result. First, we show that training with a pixel-wise loss weighted by SSIM increases reconstruction quality according to several metrics. Second, we modify the recurrent architecture to improve spatial diffusion, which allows the network to more effectively capture and propagate image information through the network's hidden state. Finally, in addition to lossless entropy coding, we use a spatially adaptive bit allocation algorithm to more efficiently use the limited number of bits to encode visually complex image regions. We evaluate our method on the Kodak and Tecnick image sets and compare against standard codecs as well recently published methods based on deep neural networks.

View on arXiv PDF

Similar