CVROOct 18, 2022

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

Georgia Tech
arXiv:2210.10108v273 citationsh-index: 51
Originality Incremental advance
AI Analysis

This addresses robust 6-DoF pose estimation for computer vision applications, representing an incremental improvement through parallelization and loss function enhancements.

The paper tackles camera pose estimation from a single RGB image by optimizing a fast NeRF model to match observed pixels, achieving improved generalization and robustness on synthetic and real-world benchmarks.

We present a parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF pose of a camera with respect to an object or scene. Given a single observed RGB image of the target, we can predict the translation and rotation of the camera by minimizing the residual between pixels rendered from a fast NeRF model and pixels in the observed image. We integrate a momentum-based camera extrinsic optimization procedure into Instant Neural Graphics Primitives, a recent exceptionally fast NeRF implementation. By introducing parallel Monte Carlo sampling into the pose estimation task, our method overcomes local minima and improves efficiency in a more extensive search space. We also show the importance of adopting a more robust pixel-based loss function to reduce error. Experiments demonstrate that our method can achieve improved generalization and robustness on both synthetic and real-world benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes