CVMay 22, 2023

Fast Monocular Scene Reconstruction with Global-Sparse Local-Dense Grids

arXiv:2305.13220v113 citations
Originality Highly original
AI Analysis

This work addresses speed limitations for augmented reality and robotics developers, offering a significant performance improvement over existing neural implicit methods.

The paper tackles the problem of slow training and rendering in monocular indoor scene reconstruction by replacing MLPs with a sparse voxel block grid representation, achieving 10x faster training and 100x faster rendering with comparable accuracy to state-of-the-art methods.

Indoor scene reconstruction from monocular images has long been sought after by augmented reality and robotics developers. Recent advances in neural field representations and monocular priors have led to remarkable results in scene-level surface reconstructions. The reliance on Multilayer Perceptrons (MLP), however, significantly limits speed in training and rendering. In this work, we propose to directly use signed distance function (SDF) in sparse voxel block grids for fast and accurate scene reconstruction without MLPs. Our globally sparse and locally dense data structure exploits surfaces' spatial sparsity, enables cache-friendly queries, and allows direct extensions to multi-modal data such as color and semantic labels. To apply this representation to monocular scene reconstruction, we develop a scale calibration algorithm for fast geometric initialization from monocular depth priors. We apply differentiable volume rendering from this initialization to refine details with fast convergence. We also introduce efficient high-dimensional Continuous Random Fields (CRFs) to further exploit the semantic-geometry consistency between scene objects. Experiments show that our approach is 10x faster in training and 100x faster in rendering while achieving comparable accuracy to state-of-the-art neural implicit methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes