LGAIITCOMLApr 21, 2021

Lossless Compression with Latent Variable Models

arXiv:2104.10544v28 citations
Originality Incremental advance
AI Analysis

This provides a novel approach to lossless compression for data storage and transmission, though it builds incrementally on existing latent variable model techniques.

The paper tackles lossless compression by developing BB-ANS, a method using latent variable models that achieves optimal compression rates in batches, demonstrating state-of-the-art results on MNIST and extending to full-size ImageNet images with hierarchical models.

We develop a simple and elegant method for lossless compression using latent variable models, which we call 'bits back with asymmetric numeral systems' (BB-ANS). The method involves interleaving encode and decode steps, and achieves an optimal rate when compressing batches of data. We demonstrate it firstly on the MNIST test set, showing that state-of-the-art lossless compression is possible using a small variational autoencoder (VAE) model. We then make use of a novel empirical insight, that fully convolutional generative models, trained on small images, are able to generalize to images of arbitrary size, and extend BB-ANS to hierarchical latent variable models, enabling state-of-the-art lossless compression of full-size colour images from the ImageNet dataset. We describe 'Craystack', a modular software framework which we have developed for rapid prototyping of compression using deep generative models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes