Pole-based Vehicle Localization with Vector Maps: A Camera-LiDAR Comparative Study
This work addresses localization challenges for autonomous vehicles in GNSS-denied urban areas, presenting an incremental improvement by integrating cost-efficient camera systems.
The paper tackles vehicle localization in urban environments by using pole features from vector maps, comparing LiDAR and camera-based methods, and demonstrates that a lightweight neural network for camera-based detection achieves high accuracy in open road conditions.
For autonomous navigation, accurate localization with respect to a map is needed. In urban environments, infrastructure such as buildings or bridges cause major difficulties to Global Navigation Satellite Systems (GNSS) and, despite advances in inertial navigation, it is necessary to support them with other sources of exteroceptive information. In road environments, many common furniture such as traffic signs, traffic lights and street lights take the form of poles. By georeferencing these features in vector maps, they can be used within a localization filter that includes a detection pipeline and a data association method. Poles, having discriminative vertical structures, can be extracted from 3D geometric information using LiDAR sensors. Alternatively, deep neural networks can be employed to detect them from monocular cameras. The lack of depth information induces challenges in associating camera detections with map features. Yet, multi-camera integration provides a cost-efficient solution. This paper quantitatively evaluates the efficacy of these approaches in terms of localization. It introduces a real-time method for camera-based pole detection using a lightweight neural network trained on automatically annotated images. The proposed methods' efficiency is assessed on a challenging sequence with a vector map. The results highlight the high accuracy of the vision-based approach in open road conditions.