CVMay 13, 2020

Self-Supervised Deep Visual Odometry with Online Adaptation

arXiv:2005.06136v179 citations
AI Analysis

This addresses the robustness issue for practical applications of VO in changing environments, representing an incremental improvement over existing self-supervised methods.

The paper tackles the problem of performance degradation in self-supervised visual odometry (VO) when applied to scenes different from training data, proposing an online meta-learning algorithm with convLSTM and feature alignment that consistently outperforms state-of-the-art baselines on unseen environments.

Self-supervised VO methods have shown great success in jointly estimating camera pose and depth from videos. However, like most data-driven methods, existing VO networks suffer from a notable decrease in performance when confronted with scenes different from the training data, which makes them unsuitable for practical applications. In this paper, we propose an online meta-learning algorithm to enable VO networks to continuously adapt to new environments in a self-supervised manner. The proposed method utilizes convolutional long short-term memory (convLSTM) to aggregate rich spatial-temporal information in the past. The network is able to memorize and learn from its past experience for better estimation and fast adaptation to the current frame. When running VO in the open world, in order to deal with the changing environment, we propose an online feature alignment method by aligning feature distributions at different time. Our VO network is able to seamlessly adapt to different environments. Extensive experiments on unseen outdoor scenes, virtual to real world and outdoor to indoor environments demonstrate that our method consistently outperforms state-of-the-art self-supervised VO baselines considerably.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes