CVSep 28, 2022

Multi-Sample Training for Neural Image Compression

arXiv:2209.13834v16 citationsh-index: 25
Originality Incremental advance
AI Analysis

This work addresses compression efficiency for image processing applications, offering an incremental enhancement to existing neural compression techniques.

The paper tackles the problem of lossy neural image compression by proposing a multiple-sample training approach using an importance weighted autoencoder target, which improves state-of-the-art methods with plug-and-play applicability.

This paper considers the problem of lossy neural image compression (NIC). Current state-of-the-art (sota) methods adopt uniform posterior to approximate quantization noise, and single-sample pathwise estimator to approximate the gradient of evidence lower bound (ELBO). In this paper, we propose to train NIC with multiple-sample importance weighted autoencoder (IWAE) target, which is tighter than ELBO and converges to log likelihood as sample size increases. First, we identify that the uniform posterior of NIC has special properties, which affect the variance and bias of pathwise and score function estimators of the IWAE target. Moreover, we provide insights on a commonly adopted trick in NIC from gradient variance perspective. Based on those analysis, we further propose multiple-sample NIC (MS-NIC), an enhanced IWAE target for NIC. Experimental results demonstrate that it improves sota NIC methods. Our MS-NIC is plug-and-play, and can be easily extended to other neural compression tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes