CVLGRONov 17, 2020

Exploring Self-Attention for Visual Odometry

arXiv:2011.08634v18 citations
AI Analysis

This work addresses the problem of unreliable motion information in visual odometry for researchers and practitioners, particularly concerning dynamic objects and texture-less surfaces, by proposing an incremental improvement through self-attention.

This paper explores the effectiveness of self-attention mechanisms in visual odometry networks, which traditionally rely on pretrained optical flow networks. The authors demonstrate that incorporating self-attention leads to better feature extraction and improved odometry performance compared to networks without such structures.

Visual odometry networks commonly use pretrained optical flow networks in order to derive the ego-motion between consecutive frames. The features extracted by these networks represent the motion of all the pixels between frames. However, due to the existence of dynamic objects and texture-less surfaces in the scene, the motion information for every image region might not be reliable for inferring odometry due to the ineffectiveness of dynamic objects in derivation of the incremental changes in position. Recent works in this area lack attention mechanisms in their structures to facilitate dynamic reweighing of the feature maps for extracting more refined egomotion information. In this paper, we explore the effectiveness of self-attention in visual odometry. We report qualitative and quantitative results against the SOTA methods. Furthermore, saliency-based studies alongside specially designed experiments are utilized to investigate the effect of self-attention on VO. Our experiments show that using self-attention allows for the extraction of better features while achieving a better odometry performance compared to networks that lack such structures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes