ROCVSep 21, 2021

Scale-aware direct monocular odometry

arXiv:2109.10077v26 citations
AI Analysis

This work addresses scale drift in monocular SLAM for robotics and autonomous vehicles, representing a strong incremental improvement over existing methods.

The authors tackled scale drift in monocular visual odometry by developing a framework that integrates depth predictions from neural networks into a tightly-coupled optimization, resulting in a system that outperforms classic monocular SLAM by 5 to 9 times in precision on the KITTI dataset.

We present a generic framework for scale-aware direct monocular odometry based on depth prediction from a deep neural network. In contrast with previous methods where depth information is only partially exploited, we formulate a novel depth prediction residual which allows us to incorporate multi-view depth information. In addition, we propose to use a truncated robust cost function which prevents considering inconsistent depth estimations. The photometric and depth-prediction measurements are integrated into a tightly-coupled optimization leading to a scale-aware monocular system which does not accumulate scale drift. Our proposal does not particularize for a concrete neural network, being able to work along with the vast majority of the existing depth prediction solutions. We demonstrate the validity and generality of our proposal evaluating it on the KITTI odometry dataset, using two publicly available neural networks and comparing it with similar approaches and the state-of-the-art for monocular and stereo SLAM. Experiments show that our proposal largely outperforms classic monocular SLAM, being 5 to 9 times more precise, beating similar approaches and having an accuracy which is closer to that of stereo systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes