CVROAug 30, 2024

RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning

arXiv:2409.00206v210 citationsh-index: 14
Originality Highly original
AI Analysis

This addresses localization failures in scenarios with viewpoint or appearance changes for autonomous systems, representing a new paradigm rather than an incremental improvement.

The paper tackles the problem of global localization in autonomous driving and robotics by introducing a PR-by-PE paradigm that bypasses separate place recognition, directly deriving it from pose estimation, and shows that RING# outperforms state-of-the-art methods on NCLT and Oxford datasets.

Global localization using onboard perception sensors, such as cameras and LiDARs, is crucial in autonomous driving and robotics applications when GPS signals are unreliable. Most approaches achieve global localization by sequential place recognition (PR) and pose estimation (PE). Some methods train separate models for each task, while others employ a single model with dual heads, trained jointly with separate task-specific losses. However, the accuracy of localization heavily depends on the success of place recognition, which often fails in scenarios with significant changes in viewpoint or environmental appearance. Consequently, this renders the final pose estimation of localization ineffective. To address this, we introduce a new paradigm, PR-by-PE localization, which bypasses the need for separate place recognition by directly deriving it from pose estimation. We propose RING#, an end-to-end PR-by-PE localization network that operates in the bird's-eye-view (BEV) space, compatible with both vision and LiDAR sensors. RING# incorporates a novel design that learns two equivariant representations from BEV features, enabling globally convergent and computationally efficient pose estimation. Comprehensive experiments on the NCLT and Oxford datasets show that RING# outperforms state-of-the-art methods in both vision and LiDAR modalities, validating the effectiveness of the proposed approach. The code will be publicly released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes