CVAug 24, 2019

Efficient Learning on Point Clouds with Basis Point Sets

Sergey Prokudin, Christoph Lassner, Javier Romero

arXiv:1908.09186v124.2159 citationsh-index: 41Has Code

Originality Incremental advance

AI Analysis

This addresses the computational inefficiency and high parameter counts in existing deep learning methods for point clouds, benefiting computer vision applications like 3D scanning and scene analysis.

The paper tackles the problem of efficiently processing unordered point clouds for machine learning by proposing basis point sets (BPS), a residual representation that matches PointNet's performance on shape classification with three orders of magnitude fewer floating-point operations and enables real-time, single-pass high-resolution mesh registration.

With the increased availability of 3D scanning technology, point clouds are moving into the focus of computer vision as a rich representation of everyday scenes. However, they are hard to handle for machine learning algorithms due to their unordered structure. One common approach is to apply occupancy grid mapping, which dramatically increases the amount of data stored and at the same time loses details through discretization. Recently, deep learning models were proposed to handle point clouds directly and achieve input permutation invariance. However, these architectures often use an increased number of parameters and are computationally inefficient. In this work, we propose basis point sets (BPS) as a highly efficient and fully general way to process point clouds with machine learning algorithms. The basis point set representation is a residual representation that can be computed efficiently and can be used with standard neural network architectures and other machine learning algorithms. Using the proposed representation as the input to a simple fully connected network allows us to match the performance of PointNet on a shape classification task while using three orders of magnitude less floating-point operations. In a second experiment, we show how the proposed representation can be used for registering high-resolution meshes to noisy 3D scans. Here, we present the first method for single-pass high-resolution mesh registration, avoiding time-consuming per-scan optimization and allowing real-time execution.

View on arXiv PDF Code

Similar