CVROSep 16, 2022

SeqOT: A Spatial-Temporal Transformer Network for Place Recognition Using Sequential LiDAR Data

arXiv:2209.07951v169 citationsh-index: 34Has Code
Originality Highly original
AI Analysis

This addresses loop closing and global localization for autonomous vehicles, representing a strong specific gain in LiDAR-based place recognition.

The paper tackled place recognition for autonomous vehicles using sequential LiDAR scans, proposing SeqOT, a transformer-based network that outperformed state-of-the-art methods on four datasets and operated online faster than the sensor frame rate.

Place recognition is an important component for autonomous vehicles to achieve loop closing or global localization. In this paper, we tackle the problem of place recognition based on sequential 3D LiDAR scans obtained by an onboard LiDAR sensor. We propose a transformer-based network named SeqOT to exploit the temporal and spatial information provided by sequential range images generated from the LiDAR data. It uses multi-scale transformers to generate a global descriptor for each sequence of LiDAR range images in an end-to-end fashion. During online operation, our SeqOT finds similar places by matching such descriptors between the current query sequence and those stored in the map. We evaluate our approach on four datasets collected with different types of LiDAR sensors in different environments. The experimental results show that our method outperforms the state-of-the-art LiDAR-based place recognition methods and generalizes well across different environments. Furthermore, our method operates online faster than the frame rate of the sensor. The implementation of our method is released as open source at: https://github.com/BIT-MJY/SeqOT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes