CVJul 31, 2024

CAMAv2: A Vision-Centric Approach for Static Map Element Annotation

Shiyuan Chen, Jiaxin Zhang, Ruohong Mei, Yingfeng Cai, Haoran Yin, Tao Chen, Wei Sui, Cong Yang

arXiv:2407.21331v16.53 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This addresses the need for consistent and accurate training data for HD map construction in autonomous driving, representing an incremental improvement over prior methods.

The paper tackles the problem of generating high-quality 3D annotations for static map elements in autonomous driving datasets, achieving lower reprojection errors (e.g., 4.96 vs. 8.03 pixels) compared to existing manual annotations.

The recent development of online static map element (a.k.a. HD map) construction algorithms has raised a vast demand for data with ground truth annotations. However, available public datasets currently cannot provide high-quality training data regarding consistency and accuracy. For instance, the manual labelled (low efficiency) nuScenes still contains misalignment and inconsistency between the HD maps and images (e.g., around 8.03 pixels reprojection error on average). To this end, we present CAMAv2: a vision-centric approach for Consistent and Accurate Map Annotation. Without LiDAR inputs, our proposed framework can still generate high-quality 3D annotations of static map elements. Specifically, the annotation can achieve high reprojection accuracy across all surrounding cameras and is spatial-temporal consistent across the whole sequence. We apply our proposed framework to the popular nuScenes dataset to provide efficient and highly accurate annotations. Compared with the original nuScenes static map element, our CAMAv2 annotations achieve lower reprojection errors (e.g., 4.96 vs. 8.03 pixels). Models trained with annotations from CAMAv2 also achieve lower reprojection errors (e.g., 5.62 vs. 8.43 pixels).

View on arXiv PDF Code

Similar