HDMapNet: An Online HD Map Construction and Evaluation Framework
This addresses the scalability issue of manual map annotation in autonomous driving, though it is an incremental advance in sensor fusion for map learning.
The paper tackles the problem of constructing high-definition semantic maps for autonomous driving by introducing HDMapNet, a method that dynamically builds local maps from onboard sensors, achieving over 50% improvement in metrics compared to baselines on the nuScenes dataset.
Constructing HD semantic maps is a central component of autonomous driving. However, traditional pipelines require a vast amount of human efforts and resources in annotating and maintaining the semantics in the map, which limits its scalability. In this paper, we introduce the problem of HD semantic map learning, which dynamically constructs the local semantics based on onboard sensor observations. Meanwhile, we introduce a semantic map learning method, dubbed HDMapNet. HDMapNet encodes image features from surrounding cameras and/or point clouds from LiDAR, and predicts vectorized map elements in the bird's-eye view. We benchmark HDMapNet on nuScenes dataset and show that in all settings, it performs better than baseline methods. Of note, our camera-LiDAR fusion-based HDMapNet outperforms existing methods by more than 50% in all metrics. In addition, we develop semantic-level and instance-level metrics to evaluate the map learning performance. Finally, we showcase our method is capable of predicting a locally consistent map. By introducing the method and metrics, we invite the community to study this novel map learning problem.