CVIVSPMar 10, 2025

Semantic Communications with Computer Vision Sensing for Edge Video Transmission

arXiv:2503.07252v17 citationsh-index: 115IEEE Trans Mob Comput
Originality Incremental advance
AI Analysis

This work addresses spectrum resource consumption for edge applications like surveillance, though it appears incremental as it builds on existing semantic communication methods by adding sensing capabilities.

The paper tackles the problem of spectrum inefficiency in edge video transmission by proposing a semantic communication framework with computer vision sensing, which adaptively compresses static and dynamic frames and uses real-time sensing to assess frame significance, resulting in enhanced transmission efficiency while preserving semantic information.

Despite the widespread adoption of vision sensors in edge applications, such as surveillance, the transmission of video data consumes substantial spectrum resources. Semantic communication (SC) offers a solution by extracting and compressing information at the semantic level, preserving the accuracy and relevance of transmitted data while significantly reducing the volume of transmitted information. However, traditional SC methods face inefficiencies due to the repeated transmission of static frames in edge videos, exacerbated by the absence of sensing capabilities, which results in spectrum inefficiency. To address this challenge, we propose a SC with computer vision sensing (SCCVS) framework for edge video transmission. The framework first introduces a compression ratio (CR) adaptive SC (CRSC) model, capable of adjusting CR based on whether the frames are static or dynamic, effectively conserving spectrum resources. Additionally, we implement an object detection and semantic segmentation models-enabled sensing (OSMS) scheme, which intelligently senses the changes in the scene and assesses the significance of each frame through in-context analysis. Hence, The OSMS scheme provides CR prompts to the CRSC model based on real-time sensing results. Moreover, both CRSC and OSMS are designed as lightweight models, ensuring compatibility with resource-constrained sensors commonly used in practical edge applications. Experimental simulations validate the effectiveness of the proposed SCCVS framework, demonstrating its ability to enhance transmission efficiency without sacrificing critical semantic information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes