ROAIDec 2, 2024

A Semantic Communication System for Real-time 3D Reconstruction Tasks

arXiv:2412.01191v12 citationsh-index: 3ICCIS
Originality Incremental advance
AI Analysis

This enables affordable, real-time semantic mapping for robotics and scene understanding applications, though it appears incremental as it builds on existing cloud-edge and semantic communication concepts.

The paper tackles the problem of real-time 3D semantic mapping on resource-limited mobile devices by proposing a semantic communication system, achieving less than 0.1% error in mapping accuracy and update times under 1 second.

3D semantic maps have played an increasingly important role in high-precision robot localization and scene understanding. However, real-time construction of semantic maps requires mobile edge devices with extremely high computing power, which are expensive and limit the widespread application of semantic mapping. In order to address this limitation, inspired by cloud-edge collaborative computing and the high transmission efficiency of semantic communication, this paper proposes a method to achieve real-time semantic mapping tasks with limited-resource mobile devices. Specifically, we design an encoding-decoding semantic communication framework for real-time semantic mapping tasks under limited-resource situations. In addition, considering the impact of different channel conditions on communication, this paper designs a module based on the attention mechanism to achieve stable data transmission under various channel conditions. In terms of simulation experiments, based on the TUM dataset, it was verified that the system has an error of less than 0.1% compared to the groundtruth in mapping and localization accuracy and is superior to some novel semantic communication algorithms in real-time performance and channel adaptation. Besides, we implement a prototype system to verify the effectiveness of the proposed framework and designed module in real indoor scenarios. The results show that our system can complete real-time semantic mapping tasks for common indoor objects (chairs, computers, people, etc.) with a limited-resource device, and the mapping update time is less than 1 second.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes