IVCVSep 12, 2022

High-Fidelity Variable-Rate Image Compression via Invertible Activation Transformation

arXiv:2209.05054v117 citationsh-index: 34
Originality Incremental advance
AI Analysis

This addresses the issue of image quality degradation in variable-rate compression for applications requiring multiple processing steps, though it is incremental as it builds on existing VAE and INN-based methods.

The paper tackles the problem of variable-rate image compression by proposing an Invertible Activation Transformation (IAT) module to maintain high fidelity and avoid artifacts during multiple compression/decompression cycles, achieving comparable rate-distortion performance and outperforming state-of-the-art methods by a large margin after re-encodings.

Learning-based methods have effectively promoted the community of image compression. Meanwhile, variational autoencoder (VAE) based variable-rate approaches have recently gained much attention to avoid the usage of a set of different networks for various compression rates. Despite the remarkable performance that has been achieved, these approaches would be readily corrupted once multiple compression/decompression operations are executed, resulting in the fact that image quality would be tremendously dropped and strong artifacts would appear. Thus, we try to tackle the issue of high-fidelity fine variable-rate image compression and propose the Invertible Activation Transformation (IAT) module. We implement the IAT in a mathematical invertible manner on a single rate Invertible Neural Network (INN) based model and the quality level (QLevel) would be fed into the IAT to generate scaling and bias tensors. IAT and QLevel together give the image compression model the ability of fine variable-rate control while better maintaining the image fidelity. Extensive experiments demonstrate that the single rate image compression model equipped with our IAT module has the ability to achieve variable-rate control without any compromise. And our IAT-embedded model obtains comparable rate-distortion performance with recent learning-based image compression methods. Furthermore, our method outperforms the state-of-the-art variable-rate image compression method by a large margin, especially after multiple re-encodings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes