LGCVITJun 17, 2022

Fast Lossless Neural Compression with Integer-Only Discrete Flows

arXiv:2206.08869v18 citationsh-index: 34
Originality Incremental advance
AI Analysis

This work addresses the deployment bottleneck of neural compressors in practical applications, representing an incremental improvement in efficiency.

The paper tackled the high inference latency of neural compressors by proposing Integer-only Discrete Flows (IODF), an efficient neural compressor using integer-only arithmetic, achieving a 10x inference speedup compared to existing methods while maintaining high compression rates on ImageNet32 and ImageNet64.

By applying entropy codecs with learned data distributions, neural compressors have significantly outperformed traditional codecs in terms of compression ratio. However, the high inference latency of neural networks hinders the deployment of neural compressors in practical applications. In this work, we propose Integer-only Discrete Flows (IODF), an efficient neural compressor with integer-only arithmetic. Our work is built upon integer discrete flows, which consists of invertible transformations between discrete random variables. We propose efficient invertible transformations with integer-only arithmetic based on 8-bit quantization. Our invertible transformation is equipped with learnable binary gates to remove redundant filters during inference. We deploy IODF with TensorRT on GPUs, achieving 10x inference speedup compared to the fastest existing neural compressors, while retaining the high compression rates on ImageNet32 and ImageNet64.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes