CVMar 13, 2024

NeRF-Supervised Feature Point Detection and Description

arXiv:2403.08156v32 citationsh-index: 1ECCV Workshops
Originality Incremental advance
AI Analysis

This addresses a bottleneck in computer vision applications like SLAM and 3D reconstruction by improving training data realism, though it is incremental as it adapts existing methods rather than introducing new ones.

The paper tackles the problem of limited generalizability in learning-based feature point detection and description by using Neural Radiance Fields (NeRFs) to generate realistic multi-view training data, achieving competitive or superior performance on standard benchmarks for tasks like relative pose estimation while requiring significantly less training data and time.

Feature point detection and description is the backbone for various computer vision applications, such as Structure-from-Motion, visual SLAM, and visual place recognition. While learning-based methods have surpassed traditional handcrafted techniques, their training often relies on simplistic homography-based simulations of multi-view perspectives, limiting model generalisability. This paper presents a novel approach leveraging Neural Radiance Fields (NeRFs) to generate a diverse and realistic dataset consisting of indoor and outdoor scenes. Our proposed methodology adapts state-of-the-art feature detectors and descriptors for training on multi-view NeRF-synthesised data, with supervision achieved through perspective projective geometry. Experiments demonstrate that the proposed methodology achieves competitive or superior performance on standard benchmarks for relative pose estimation, point cloud registration, and homography estimation while requiring significantly less training data and time compared to existing approaches.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes