CVAug 26, 2022

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

arXiv:2208.12697v5131 citationsh-index: 110Has Code
Originality Highly original
AI Analysis

This work addresses efficiency and accuracy bottlenecks in 3D surface reconstruction for computer vision applications, representing a strong incremental improvement over existing voxel-based methods.

The paper tackles the problem of slow and inaccurate neural surface reconstruction from multi-view images by introducing Voxurf, a voxel-based method that achieves a 20x training speedup and higher reconstruction quality on the DTU benchmark compared to previous fully implicit approaches.

Neural surface reconstruction aims to reconstruct accurate 3D surfaces based on multi-view images. Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene. Recent efforts explore the explicit volumetric representation to accelerate the optimization via memorizing significant information with learnable voxel grids. However, existing voxel-based methods often struggle in reconstructing fine-grained geometry, even when combined with an SDF-based volume rendering scheme. We reveal that this is because 1) the voxel grids tend to break the color-geometry dependency that facilitates fine-geometry learning, and 2) the under-constrained voxel grids lack spatial coherence and are vulnerable to local minima. In this work, we present Voxurf, a voxel-based surface reconstruction approach that is both efficient and accurate. Voxurf addresses the aforementioned issues via several key designs, including 1) a two-stage training procedure that attains a coherent coarse shape and recovers fine details successively, 2) a dual color network that maintains color-geometry dependency, and 3) a hierarchical geometry feature to encourage information propagation across voxels. Extensive experiments show that Voxurf achieves high efficiency and high quality at the same time. On the DTU benchmark, Voxurf achieves higher reconstruction quality with a 20x training speedup compared to previous fully implicit methods. Our code is available at https://github.com/wutong16/Voxurf.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes