IVCVAug 29, 2024

Learned Image Transmission with Hierarchical Variational Autoencoder

arXiv:2408.16340v58 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses efficient and robust image transmission for communication systems, representing an incremental improvement with novel architectural elements.

The paper tackles image transmission over noisy channels by introducing a hierarchical joint source-channel coding framework using a hierarchical VAE, which dynamically adjusts transmission bandwidth and outperforms existing baselines in rate-distortion performance.

In this paper, we introduce an innovative hierarchical joint source-channel coding (HJSCC) framework for image transmission, utilizing a hierarchical variational autoencoder (VAE). Our approach leverages a combination of bottom-up and top-down paths at the transmitter to autoregressively generate multiple hierarchical representations of the original image. These representations are then directly mapped to channel symbols for transmission by the JSCC encoder. We extend this framework to scenarios with a feedback link, modeling transmission over a noisy channel as a probabilistic sampling process and deriving a novel generative formulation for JSCC with feedback. Compared with existing approaches, our proposed HJSCC provides enhanced adaptability by dynamically adjusting transmission bandwidth, encoding these representations into varying amounts of channel symbols. Extensive experiments on images of varying resolutions demonstrate that our proposed model outperforms existing baselines in rate-distortion performance and maintains robustness against channel noise. The source code will be made available upon acceptance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes