CVJun 4, 2022

Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation

MIT
arXiv:2206.01916v113 citationsh-index: 31
Originality Incremental advance
AI Analysis

This work addresses camera pose estimation for computer vision applications, offering an incremental improvement by enhancing existing pose estimators with a novel scene representation.

The paper tackles camera pose estimation by combining keypoint-based optimization with an invertible neural rendering mechanism using Nerfels, a locally dense but globally sparse 3D scene representation, resulting in improved performance on ScanNet for wide camera baseline scenarios.

This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism. Our proposed 3D scene representation, Nerfels, is locally dense yet globally sparse. As opposed to existing invertible neural rendering systems which overfit a model to the entire scene, we adopt a feature-driven approach for representing scene-agnostic, local 3D patches with renderable codes. By modelling a scene only where local features are detected, our framework effectively generalizes to unseen local regions in the scene via an optimizable code conditioning mechanism in the neural renderer, all while maintaining the low memory footprint of a sparse 3D map representation. Our model can be incorporated to existing state-of-the-art hand-crafted and learned local feature pose estimators, yielding improved performance when evaluating on ScanNet for wide camera baseline scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes