IVITLGJun 22, 2021

Neural Distributed Image Compression using Common Information

arXiv:2106.11723v230 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient image compression in scenarios like stereo vision, where side information is available only at the receiver, offering a practical solution for applications such as autonomous driving or 3D imaging.

The paper tackles the problem of compressing an image when a correlated image is available only at the decoder, using a deep neural network architecture for distributed source coding, and demonstrates improved performance on stereo image datasets like KITTI and Cityscape.

We present a novel deep neural network (DNN) architecture for compressing an image when a correlated image is available as side information only at the decoder. This problem is known as distributed source coding (DSC) in information theory. In particular, we consider a pair of stereo images, which generally have high correlation with each other due to overlapping fields of view, and assume that one image of the pair is to be compressed and transmitted, while the other image is available only at the decoder. In the proposed architecture, the encoder maps the input image to a latent space, quantizes the latent representation, and compresses it using entropy coding. The decoder is trained to extract the common information between the input image and the correlated image, using only the latter. The received latent representation and the locally generated common information are passed through a decoder network to obtain an enhanced reconstruction of the input image. The common information provides a succinct representation of the relevant information at the receiver. We train and demonstrate the effectiveness of the proposed approach on the KITTI and Cityscape datasets of stereo image pairs. Our results show that the proposed architecture is capable of exploiting the decoder-only side information, and outperforms previous work on stereo image compression with decoder side information.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes