ROCVJul 28, 2024

Solving Short-Term Relocalization Problems In Monocular Keyframe Visual SLAM Using Spatial And Semantic Data

arXiv:2407.19518v12 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the need for reliable navigation and precise behaviors in mobile robots operating in GPS-denied environments, representing an incremental improvement over existing methods.

The paper tackles the problem of short-term relocalization in monocular keyframe visual SLAM for mobile robots by introducing a novel multimodal keyframe descriptor that combines semantic and spatial data, resulting in accurate pose recovery as demonstrated on indoor GPS-denied datasets compared to a bag-of-words approach.

In Monocular Keyframe Visual Simultaneous Localization and Mapping (MKVSLAM) frameworks, when incremental position tracking fails, global pose has to be recovered in a short-time window, also known as short-term relocalization. This capability is crucial for mobile robots to have reliable navigation, build accurate maps, and have precise behaviors around human collaborators. This paper focuses on the development of robust short-term relocalization capabilities for mobile robots using a monocular camera system. A novel multimodal keyframe descriptor is introduced, that contains semantic information of objects detected in the environment and the spatial information of the camera. Using this descriptor, a new Keyframe-based Place Recognition (KPR) method is proposed that is formulated as a multi-stage keyframe filtering algorithm, leading to a new relocalization pipeline for MKVSLAM systems. The proposed approach is evaluated over several indoor GPS denied datasets and demonstrates accurate pose recovery, in comparison to a bag-of-words approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes