CVJul 31, 2024

CAMAv2: A Vision-Centric Approach for Static Map Element Annotation

arXiv:2407.21331v13 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the need for consistent and accurate training data for HD map construction in autonomous driving, representing an incremental improvement over prior methods.

The paper tackles the problem of generating high-quality 3D annotations for static map elements in autonomous driving datasets, achieving lower reprojection errors (e.g., 4.96 vs. 8.03 pixels) compared to existing manual annotations.

The recent development of online static map element (a.k.a. HD map) construction algorithms has raised a vast demand for data with ground truth annotations. However, available public datasets currently cannot provide high-quality training data regarding consistency and accuracy. For instance, the manual labelled (low efficiency) nuScenes still contains misalignment and inconsistency between the HD maps and images (e.g., around 8.03 pixels reprojection error on average). To this end, we present CAMAv2: a vision-centric approach for Consistent and Accurate Map Annotation. Without LiDAR inputs, our proposed framework can still generate high-quality 3D annotations of static map elements. Specifically, the annotation can achieve high reprojection accuracy across all surrounding cameras and is spatial-temporal consistent across the whole sequence. We apply our proposed framework to the popular nuScenes dataset to provide efficient and highly accurate annotations. Compared with the original nuScenes static map element, our CAMAv2 annotations achieve lower reprojection errors (e.g., 4.96 vs. 8.03 pixels). Models trained with annotations from CAMAv2 also achieve lower reprojection errors (e.g., 5.62 vs. 8.43 pixels).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes