CVJul 10, 2022

2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds

arXiv:2207.04397v3326 citationsh-index: 43
Originality Incremental advance
AI Analysis

This addresses the limitation of fusion-based methods in autonomous driving by enabling improved segmentation with only point cloud inputs, though it is incremental as it builds on existing multi-modality fusion ideas.

The paper tackles the problem of semantic segmentation on LiDAR point clouds by proposing 2DPASS, a training scheme that uses 2D image priors to enhance 3D representation learning without requiring paired data during inference, achieving state-of-the-art results on SemanticKITTI and NuScenes benchmarks.

As camera and LiDAR sensors capture complementary information used in autonomous driving, great efforts have been made to develop semantic segmentation algorithms through multi-modality data fusion. However, fusion-based approaches require paired data, i.e., LiDAR point clouds and camera images with strict point-to-pixel mappings, as the inputs in both training and inference, which seriously hinders their application in practical scenarios. Thus, in this work, we propose the 2D Priors Assisted Semantic Segmentation (2DPASS), a general training scheme, to boost the representation learning on point clouds, by fully taking advantage of 2D images with rich appearance. In practice, by leveraging an auxiliary modal fusion and multi-scale fusion-to-single knowledge distillation (MSFSKD), 2DPASS acquires richer semantic and structural information from the multi-modal data, which are then online distilled to the pure 3D network. As a result, equipped with 2DPASS, our baseline shows significant improvement with only point cloud inputs. Specifically, it achieves the state-of-the-arts on two large-scale benchmarks (i.e. SemanticKITTI and NuScenes), including top-1 results in both single and multiple scan(s) competitions of SemanticKITTI.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes