CVDec 14, 2022

MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds

arXiv:2212.07207v528 citationsh-index: 88
Originality Incremental advance
AI Analysis

This addresses the challenge of tedious 3D annotation for autonomous driving perception tasks, offering a novel pre-training framework that is incremental in leveraging existing autoencoder concepts for LiDAR data.

The paper tackles the problem of large blind spots in LiDAR point clouds by proposing MAELi, a masked autoencoder for self-supervised representation learning that reduces the need for 3D annotations, achieving state-of-the-art performance in 3D object detection and semantic segmentation for autonomous driving.

The sensing process of large-scale LiDAR point clouds inevitably causes large blind spots, i.e. regions not visible to the sensor. We demonstrate how these inherent sampling properties can be effectively utilized for self-supervised representation learning by designing a highly effective pre-training framework that considerably reduces the need for tedious 3D annotations to train state-of-the-art object detectors. Our Masked AutoEncoder for LiDAR point clouds (MAELi) intuitively leverages the sparsity of LiDAR point clouds in both the encoder and decoder during reconstruction. This results in more expressive and useful initialization, which can be directly applied to downstream perception tasks, such as 3D object detection or semantic segmentation for autonomous driving. In a novel reconstruction approach, MAELi distinguishes between empty and occluded space and employs a new masking strategy that targets the LiDAR's inherent spherical projection. Thereby, without any ground truth whatsoever and trained on single frames only, MAELi obtains an understanding of the underlying 3D scene geometry and semantics. To demonstrate the potential of MAELi, we pre-train backbones in an end-to-end manner and show the effectiveness of our unsupervised pre-trained weights on the tasks of 3D object detection and semantic segmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes