CVMay 17, 2023

MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation

arXiv:2305.10579v36 citations
Originality Incremental advance
AI Analysis

This addresses the problem of slow training and lack of generalization in 3D reconstruction for researchers and practitioners in computer vision.

The paper tackles the limitations of NeRF, such as per-object training and poor generalization, by introducing MultiPlaneNeRF, which uses non-trainable representations from 2D images to achieve results comparable to state-of-the-art models in view synthesis.

NeRF is a popular model that efficiently represents 3D objects from 2D images. However, vanilla NeRF has some important limitations. NeRF must be trained on each object separately. The training time is long since we encode the object's shape and color in neural network weights. Moreover, NeRF does not generalize well to unseen data. In this paper, we present MultiPlaneNeRF -- a model that simultaneously solves the above problems. Our model works directly on 2D images. We project 3D points on 2D images to produce non-trainable representations. The projection step is not parametrized and a very shallow decoder can efficiently process the representation. Furthermore, we can train MultiPlaneNeRF on a large data set and force our implicit decoder to generalize across many objects. Consequently, we can only replace the 2D images (without additional training) to produce a NeRF representation of the new object. In the experimental section, we demonstrate that MultiPlaneNeRF achieves results comparable to state-of-the-art models for synthesizing new views and has generalization properties. Additionally, MultiPlane decoder can be used as a component in large generative models like GANs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes