CV ROOct 18, 2022

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

Yunzhi Lin, Thomas Müller, Jonathan Tremblay, Bowen Wen, Stephen Tyree, Alex Evans, Patricio A. Vela, Stan Birchfield

Georgia Tech

arXiv:2210.10108v223.873 citationsh-index: 51Has Code

Originality Incremental advance

AI Analysis

This addresses robust 6-DoF pose estimation for computer vision applications, representing an incremental improvement through parallelization and loss function enhancements.

The paper tackles camera pose estimation from a single RGB image by optimizing a fast NeRF model to match observed pixels, achieving improved generalization and robustness on synthetic and real-world benchmarks.

We present a parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF pose of a camera with respect to an object or scene. Given a single observed RGB image of the target, we can predict the translation and rotation of the camera by minimizing the residual between pixels rendered from a fast NeRF model and pixels in the observed image. We integrate a momentum-based camera extrinsic optimization procedure into Instant Neural Graphics Primitives, a recent exceptionally fast NeRF implementation. By introducing parallel Monte Carlo sampling into the pose estimation task, our method overcomes local minima and improves efficiency in a more extensive search space. We also show the importance of adopting a more robust pixel-based loss function to reduce error. Experiments demonstrate that our method can achieve improved generalization and robustness on both synthetic and real-world benchmarks.

View on arXiv PDF Code

Similar