ROSep 7, 2017

CNN for Very Fast Ground Segmentation in Velodyne LiDAR Data

arXiv:1709.02128v162 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of fast ground segmentation for autonomous vehicles or robotics, but it is incremental as it builds on existing CNN approaches with a new encoding.

The paper tackles ground segmentation in Velodyne LiDAR point clouds by proposing a CNN-based method that encodes sparse 3D data into a multi-channel 2D signal, resulting in significant speed improvements and minor accuracy gains over the state-of-the-art.

This paper presents a novel method for ground segmentation in Velodyne point clouds. We propose an encoding of sparse 3D data from the Velodyne sensor suitable for training a convolutional neural network (CNN). This general purpose approach is used for segmentation of the sparse point cloud into ground and non-ground points. The LiDAR data are represented as a multi-channel 2D signal where the horizontal axis corresponds to the rotation angle and the vertical axis the indexes channels (i.e. laser beams). Multiple topologies of relatively shallow CNNs (i.e. 3-5 convolutional layers) are trained and evaluated using a manually annotated dataset we prepared. The results show significant improvement of performance over the state-of-the-art method by Zhang et al. in terms of speed and also minor improvements in terms of accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes