CVNov 3, 2021

Panoptic 3D Scene Reconstruction From a Single RGB Image

arXiv:2111.02444v299 citations
AI Analysis

This addresses the need for comprehensive 3D perception for applications like robotics and augmented reality, representing an incremental advance by integrating existing tasks.

The paper tackles the problem of holistic 3D scene understanding from a single RGB image by unifying geometric reconstruction, semantic segmentation, and instance segmentation into panoptic 3D scene reconstruction, and demonstrates that this joint approach outperforms independent task methods.

Understanding 3D scenes from a single image is fundamental to a wide variety of tasks, such as for robotics, motion planning, or augmented reality. Existing works in 3D perception from a single RGB image tend to focus on geometric reconstruction only, or geometric reconstruction with semantic segmentation or instance segmentation. Inspired by 2D panoptic segmentation, we propose to unify the tasks of geometric reconstruction, 3D semantic segmentation, and 3D instance segmentation into the task of panoptic 3D scene reconstruction - from a single RGB image, predicting the complete geometric reconstruction of the scene in the camera frustum of the image, along with semantic and instance segmentations. We thus propose a new approach for holistic 3D scene understanding from a single RGB image which learns to lift and propagate 2D features from an input image to a 3D volumetric scene representation. We demonstrate that this holistic view of joint scene reconstruction, semantic, and instance segmentation is beneficial over treating the tasks independently, thus outperforming alternative approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes