LG AI IVJul 31, 2024

Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework

Kexin Zhang, Lixin Li, Wensheng Lin, Yuna Yan, Rui Li, Wenchi Cheng, Zhu Han

arXiv:2408.05112v112.530 citationsh-index: 13

Originality Highly original

AI Analysis

This addresses the issue of enhanced image quality in communication systems for applications like multimedia transmission, though it appears incremental as it builds on existing semantic communication with new generative models.

The paper tackles the problem of low perceptual quality in semantic communication under low SNR by introducing a generative AI-aided framework, achieving improvements of 17.75% in PSNR for AWGN channels and 20.86% for Rayleigh channels compared to CNN-based DeepJSCC.

Semantic Communication (SC) is an emerging technology aiming to surpass the Shannon limit. Traditional SC strategies often minimize signal distortion between the original and reconstructed data, neglecting perceptual quality, especially in low Signal-to-Noise Ratio (SNR) environments. To address this issue, we introduce a novel Generative AI Semantic Communication (GSC) system for single-user scenarios. This system leverages deep generative models to establish a new paradigm in SC. Specifically, At the transmitter end, it employs a joint source-channel coding mechanism based on the Swin Transformer for efficient semantic feature extraction and compression. At the receiver end, an advanced Diffusion Model (DM) reconstructs high-quality images from degraded signals, enhancing perceptual details. Additionally, we present a Multi-User Generative Semantic Communication (MU-GSC) system utilizing an asynchronous processing model. This model effectively manages multiple user requests and optimally utilizes system resources for parallel processing. Simulation results on public datasets demonstrate that our generative AI semantic communication systems achieve superior transmission efficiency and enhanced communication content quality across various channel conditions. Compared to CNN-based DeepJSCC, our methods improve the Peak Signal-to-Noise Ratio (PSNR) by 17.75% in Additive White Gaussian Noise (AWGN) channels and by 20.86% in Rayleigh channels.

View on arXiv PDF

Similar