CVApr 24, 2023

Explicit Correspondence Matching for Generalizable Neural Radiance Fields

ByteDance
arXiv:2304.12294v251 citationsh-index: 38
Originality Highly original
AI Analysis

This addresses the challenge of efficient 3D scene reconstruction for computer vision applications, though it is incremental by improving on existing NeRF methods.

The paper tackles the problem of generalizing Neural Radiance Fields (NeRF) to new scenarios with few source views by explicitly modeling correspondence matching, achieving state-of-the-art results in novel view synthesis.

We present a new generalizable NeRF method that is able to directly generalize to new unseen scenarios and perform novel view synthesis with as few as two source views. The key to our approach lies in the explicitly modeled correspondence matching information, so as to provide the geometry prior to the prediction of NeRF color and density for volume rendering. The explicit correspondence matching is quantified with the cosine similarity between image features sampled at the 2D projections of a 3D point on different views, which is able to provide reliable cues about the surface geometry. Unlike previous methods where image features are extracted independently for each view, we consider modeling the cross-view interactions via Transformer cross-attention, which greatly improves the feature matching quality. Our method achieves state-of-the-art results on different evaluation settings, with the experiments showing a strong correlation between our learned cosine feature similarity and volume density, demonstrating the effectiveness and superiority of our proposed method. The code and model are on our project page: https://donydchen.github.io/matchnerf

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes