CVOct 26, 2024

Semantic Feature Decomposition based Semantic Communication System of Images with Large-scale Visual Generation Models

arXiv:2410.20126v14 citationsh-index: 11
Originality Highly original
AI Analysis

This work addresses the need for more efficient and robust image communication in scenarios with high data volume and environmental complexity, representing a novel paradigm rather than an incremental improvement.

The authors tackled the problem of improving image communication systems by proposing a novel paradigm that integrates semantic communication with large-scale visual generation models, achieving high compression, noise resistance, and visual similarity in image transmission.

The end-to-end image communication system has been widely studied in the academic community. The escalating demands on image communication systems in terms of data volume, environmental complexity, and task precision require enhanced communication efficiency, anti-noise ability and semantic fidelity. Therefore, we proposed a novel paradigm based on Semantic Feature Decomposition (SeFD) for the integration of semantic communication and large-scale visual generation models to achieve high-performance, highly interpretable and controllable image communication. According to this paradigm, a Texture-Color based Semantic Communication system of Images TCSCI is proposed. TCSCI decomposing the images into their natural language description (text), texture and color semantic features at the transmitter. During the transmission, features are transmitted over the wireless channel, and at the receiver, a large-scale visual generation model is utilized to restore the image through received features. TCSCI can achieve extremely compressed, highly noise-resistant, and visually similar image semantic communication, while ensuring the interpretability and editability of the transmission process. The experiments demonstrate that the TCSCI outperforms traditional image communication systems and existing semantic communication systems under extreme compression with good anti-noise performance and interpretability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes