CVJun 13, 2022

ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and Mapping

arXiv:2206.05963v32 citationsh-index: 5
Originality Incremental advance
AI Analysis

This provides an efficient, low-latency solution for autonomous driving applications, such as aiding database creation and vehicle control.

The paper tackles visual Simultaneous Localization and Mapping (vSLAM) by proposing an all-through deep learning-based architecture, achieving 4.4% translation and 0.0176 deg/m rotational error on a KITTI dataset subset.

In this paper, a novel solution is introduced for visual Simultaneous Localization and Mapping (vSLAM) that is built up of Deep Learning components. The proposed architecture is a highly modular framework in which each component offers state of the art results in their respective fields of vision-based deep learning solutions. The paper shows that with the synergic integration of these individual building blocks, a functioning and efficient all-through deep neural (ATDN) vSLAM system can be created. The Embedding Distance Loss function is introduced and using it the ATDN architecture is trained. The resulting system managed to achieve 4.4% translation and 0.0176 deg/m rotational error on a subset of the KITTI dataset. The proposed architecture can be used for efficient and low-latency autonomous driving (AD) aiding database creation as well as a basis for autonomous vehicle (AV) control.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes