CVROSep 30, 2021

Semantic Dense Reconstruction with Consistent Scene Segments

arXiv:2109.14821v1
Originality Incremental advance
AI Analysis

This work addresses high-level scene understanding for robotics or AR/VR applications, but it appears incremental as it builds on existing reconstruction and segmentation techniques.

The paper tackles dense semantic 3D scene reconstruction from RGB-D sequences by proposing a method that integrates consistent 2D semantic segmentation with a novel semantic projection block, achieving state-of-the-art semantic prediction performance in experiments on public datasets.

In this paper, a method for dense semantic 3D scene reconstruction from an RGB-D sequence is proposed to solve high-level scene understanding tasks. First, each RGB-D pair is consistently segmented into 2D semantic maps based on a camera tracking backbone that propagates objects' labels with high probabilities from full scans to corresponding ones of partial views. Then a dense 3D mesh model of an unknown environment is incrementally generated from the input RGB-D sequence. Benefiting from 2D consistent semantic segments and the 3D model, a novel semantic projection block (SP-Block) is proposed to extract deep feature volumes from 2D segments of different views. Moreover, the semantic volumes are fused into deep volumes from a point cloud encoder to make the final semantic segmentation. Extensive experimental evaluations on public datasets show that our system achieves accurate 3D dense reconstruction and state-of-the-art semantic prediction performances simultaneously.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes