CVGRLGMar 27, 2021

MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis

arXiv:2103.14910v3191 citationsHas Code
AI Analysis

This addresses the problem of generating realistic 3D views from limited inputs for applications in computer vision and graphics, representing an incremental improvement by combining existing techniques.

The paper tackles novel view synthesis and depth estimation from a single image by proposing MINE, a method that generalizes Multiplane Images with NeRF for continuous depth reconstruction, achieving state-of-the-art performance on datasets like RealEstate10K and competitive depth results without supervision.

In this paper, we propose MINE to perform novel view synthesis and depth estimation via dense 3D reconstruction from a single image. Our approach is a continuous depth generalization of the Multiplane Images (MPI) by introducing the NEural radiance fields (NeRF). Given a single image as input, MINE predicts a 4-channel image (RGB and volume density) at arbitrary depth values to jointly reconstruct the camera frustum and fill in occluded contents. The reconstructed and inpainted frustum can then be easily rendered into novel RGB or depth views using differentiable rendering. Extensive experiments on RealEstate10K, KITTI and Flowers Light Fields show that our MINE outperforms state-of-the-art by a large margin in novel view synthesis. We also achieve competitive results in depth estimation on iBims-1 and NYU-v2 without annotated depth supervision. Our source code is available at https://github.com/vincentfung13/MINE

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes