CVApr 19, 2024

Weakly Supervised LiDAR Semantic Segmentation via Scatter Image Annotation

arXiv:2404.12861v24 citationsh-index: 19IEEE transactions on multimedia
Originality Highly original
AI Analysis

This addresses the annotation bottleneck for LiDAR segmentation in autonomous driving, offering a significant reduction in labeling effort compared to existing methods.

The paper tackles the problem of efficient annotation for weakly supervised LiDAR semantic segmentation by proposing scatter image annotation, which reduces labeled points to less than 0.02% to achieve over 95% of fully-supervised performance.

Weakly supervised LiDAR semantic segmentation has made significant strides with limited labeled data. However, most existing methods focus on the network training under weak supervision, while efficient annotation strategies remain largely unexplored. To tackle this gap, we implement LiDAR semantic segmentation using scatter image annotation, effectively integrating an efficient annotation strategy with network training. Specifically, we propose employing scatter images to annotate LiDAR point clouds, combining a pre-trained optical flow estimation network with a foundation image segmentation model to rapidly propagate manual annotations into dense labels for both images and point clouds. Moreover, we propose ScatterNet, a network that includes three pivotal strategies to reduce the performance gap caused by such annotations. Firstly, it utilizes dense semantic labels as supervision for the image branch, alleviating the modality imbalance between point clouds and images. Secondly, an intermediate fusion branch is proposed to obtain multimodal texture and structural features. Lastly, a perception consistency loss is introduced to determine which information needs to be fused and which needs to be discarded during the fusion process. Extensive experiments on the nuScenes and SemanticKITTI datasets have demonstrated that our method requires less than 0.02% of the labeled points to achieve over 95% of the performance of fully-supervised methods. Notably, our labeled points are only 5% of those used in the most advanced weakly supervised methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes