CVDec 11, 2018

Deep RBFNet: Point Cloud Feature Learning using Radial Basis Functions

arXiv:1812.04302v228 citations
Originality Incremental advance
AI Analysis

This work addresses overfitting and computational inefficiency in 3D object recognition for applications like portable devices, though it is incremental as it builds on existing point cloud frameworks.

The paper tackles the problem of high parameter count and overfitting in point cloud-based 3D object recognition by proposing Deep RBFNet, which uses Radial Basis Function kernels to model spatial distributions and reduce parameters. It outperforms PointNet++ in classification accuracy and enables faster training and deployment on resource-limited devices.

Three-dimensional object recognition has recently achieved great progress thanks to the development of effective point cloud-based learning frameworks, such as PointNet and its extensions. However, existing methods rely heavily on fully connected layers, which introduce a significant amount of parameters, making the network harder to train and prone to overfitting problems. In this paper, we propose a simple yet effective framework for point set feature learning by leveraging a nonlinear activation layer encoded by Radial Basis Function (RBF) kernels. Unlike PointNet variants, that fail to recognize local point patterns, our approach explicitly models the spatial distribution of point clouds by aggregating features from sparsely distributed RBF kernels. A typical RBF kernel, e.g. Gaussian function, naturally penalizes long-distance response and is only activated by neighboring points. Such localized response generates highly discriminative features given different point distributions. In addition, our framework allows the joint optimization of kernel distribution and its receptive field, automatically evolving kernel configurations in an end-to-end manner. We demonstrate that the proposed network with a single RBF layer can outperform the state-of-the-art Pointnet++ in terms of classification accuracy for 3D object recognition tasks. Moreover, the introduction of nonlinear mappings significantly reduces the number of network parameters and computational cost, enabling significantly faster training and a deployable point cloud recognition solution on portable devices with limited resources.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes