CVJan 17, 2020

Unsupervised Learning of Camera Pose with Compositional Re-estimation

arXiv:2001.06479v17 citations
AI Analysis

This work addresses camera pose estimation for applications like robotics and autonomous driving, presenting an incremental improvement over existing unsupervised methods.

The paper tackles unsupervised camera pose estimation from video sequences by proposing a compositional re-estimation process that iteratively refines camera motion using estimated depth maps, resulting in improved performance that outperforms state-of-the-art methods on the KITTI benchmark dataset.

We consider the problem of unsupervised camera pose estimation. Given an input video sequence, our goal is to estimate the camera pose (i.e. the camera motion) between consecutive frames. Traditionally, this problem is tackled by placing strict constraints on the transformation vector or by incorporating optical flow through a complex pipeline. We propose an alternative approach that utilizes a compositional re-estimation process for camera pose estimation. Given an input, we first estimate a depth map. Our method then iteratively estimates the camera motion based on the estimated depth map. Our approach significantly improves the predicted camera motion both quantitatively and visually. Furthermore, the re-estimation resolves the problem of out-of-boundaries pixels in a novel and simple way. Another advantage of our approach is that it is adaptable to other camera pose estimation approaches. Experimental analysis on KITTI benchmark dataset demonstrates that our method outperforms existing state-of-the-art approaches in unsupervised camera ego-motion estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes