CVMay 9, 2017

Deep Projective 3D Semantic Segmentation

Felix Järemo Lawin, Martin Danelljan, Patrik Tosteberg, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

arXiv:1705.03428v123.6356 citations

Originality Highly original

AI Analysis

This addresses the challenge of limited performance and high memory consumption in 3D-CNN methods for 3D segmentation, benefiting applications like autonomous driving and robotics.

The paper tackles the problem of 3D point cloud semantic segmentation by projecting point clouds onto 2D images, using a 2D-CNN for segmentation, and re-projecting results back to 3D, achieving a 7.9% relative gain over the previous state-of-the-art on the Semantic3D dataset.

Semantic segmentation of 3D point clouds is a challenging problem with numerous real-world applications. While deep learning has revolutionized the field of image semantic segmentation, its impact on point cloud data has been limited so far. Recent attempts, based on 3D deep learning approaches (3D-CNNs), have achieved below-expected results. Such methods require voxelizations of the underlying point cloud data, leading to decreased spatial resolution and increased memory consumption. Additionally, 3D-CNNs greatly suffer from the limited availability of annotated datasets. In this paper, we propose an alternative framework that avoids the limitations of 3D-CNNs. Instead of directly solving the problem in 3D, we first project the point cloud onto a set of synthetic 2D-images. These images are then used as input to a 2D-CNN, designed for semantic segmentation. Finally, the obtained prediction scores are re-projected to the point cloud to obtain the segmentation results. We further investigate the impact of multiple modalities, such as color, depth and surface normals, in a multi-stream network architecture. Experiments are performed on the recent Semantic3D dataset. Our approach sets a new state-of-the-art by achieving a relative gain of 7.9 %, compared to the previous best approach.

View on arXiv PDF

Similar