CVMar 21, 2024

SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field

arXiv:2403.14366v111 citationsh-index: 8CVPR
Originality Incremental advance
AI Analysis

This work addresses the problem of precise 3D environment perception for autonomous driving systems, representing an incremental improvement over existing object-free methods.

The paper tackles the challenge of achieving continuous and accurate 3D scene understanding for autonomous driving by proposing SurroundSDF, a method that implicitly predicts signed distance and semantic fields from surround images, achieving state-of-the-art results on occupancy prediction and 3D scene reconstruction tasks on the nuScenes dataset.

Vision-centric 3D environment understanding is both vital and challenging for autonomous driving systems. Recently, object-free methods have attracted considerable attention. Such methods perceive the world by predicting the semantics of discrete voxel grids but fail to construct continuous and accurate obstacle surfaces. To this end, in this paper, we propose SurroundSDF to implicitly predict the signed distance field (SDF) and semantic field for the continuous perception from surround images. Specifically, we introduce a query-based approach and utilize SDF constrained by the Eikonal formulation to accurately describe the surfaces of obstacles. Furthermore, considering the absence of precise SDF ground truth, we propose a novel weakly supervised paradigm for SDF, referred to as the Sandwich Eikonal formulation, which emphasizes applying correct and dense constraints on both sides of the surface, thereby enhancing the perceptual accuracy of the surface. Experiments suggest that our method achieves SOTA for both occupancy prediction and 3D scene reconstruction tasks on the nuScenes dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes