IVCVJun 18, 2024

Learned Compression of Encoding Distributions

arXiv:2406.13059v11 citations
Originality Highly original
AI Analysis

This work addresses the amortization gap in learned compression for image data, offering improved compression efficiency with lower computational cost compared to existing methods.

The paper tackled the problem of suboptimal compression due to the static encoding distribution in learned compression models, which leads to an amortization gap, by proposing a method that dynamically adapts the encoding distribution to match the latent data distribution for each input, achieving a BD-rate gain of -7.10% on the Kodak test dataset.

The entropy bottleneck introduced by Ballé et al. is a common component used in many learned compression models. It encodes a transformed latent representation using a static distribution whose parameters are learned during training. However, the actual distribution of the latent data may vary wildly across different inputs. The static distribution attempts to encompass all possible input distributions, thus fitting none of them particularly well. This unfortunate phenomenon, sometimes known as the amortization gap, results in suboptimal compression. To address this issue, we propose a method that dynamically adapts the encoding distribution to match the latent data distribution for a specific input. First, our model estimates a better encoding distribution for a given input. This distribution is then compressed and transmitted as an additional side-information bitstream. Finally, the decoder reconstructs the encoding distribution and uses it to decompress the corresponding latent data. Our method achieves a Bjøntegaard-Delta (BD)-rate gain of -7.10% on the Kodak test dataset when applied to the standard fully-factorized architecture. Furthermore, considering computational complexity, the transform used by our method is an order of magnitude cheaper in terms of Multiply-Accumulate (MAC) operations compared to related side-information methods such as the scale hyperprior.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes