CVMar 29, 2021

MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo

arXiv:2103.15595v21044 citations
AI Analysis

This enables efficient 3D scene reconstruction for applications like VR/AR, though it builds incrementally on multi-view stereo and neural rendering techniques.

The paper tackles the problem of reconstructing neural radiance fields for view synthesis from only three input views, achieving fast network inference and generalizing across different scenes while outperforming concurrent works.

We present MVSNeRF, a novel neural rendering approach that can efficiently reconstruct neural radiance fields for view synthesis. Unlike prior works on neural radiance fields that consider per-scene optimization on densely captured images, we propose a generic deep neural network that can reconstruct radiance fields from only three nearby input views via fast network inference. Our approach leverages plane-swept cost volumes (widely used in multi-view stereo) for geometry-aware scene reasoning, and combines this with physically based volume rendering for neural radiance field reconstruction. We train our network on real objects in the DTU dataset, and test it on three different datasets to evaluate its effectiveness and generalizability. Our approach can generalize across scenes (even indoor scenes, completely different from our training scenes of objects) and generate realistic view synthesis results using only three input images, significantly outperforming concurrent works on generalizable radiance field reconstruction. Moreover, if dense images are captured, our estimated radiance field representation can be easily fine-tuned; this leads to fast per-scene reconstruction with higher rendering quality and substantially less optimization time than NeRF.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes