CVDec 18, 2020

Classification of Single-View Object Point Clouds

arXiv:2012.10042v410 citations
AI Analysis

This work tackles a practical problem of classifying partial object point clouds for computer vision and robotics applications, where occlusions are common, by highlighting the necessity of pose estimation.

This paper addresses the problem of classifying single-view, partial object point clouds, a common scenario in practice due to occlusions. The authors demonstrate that existing point cloud classifiers experience a drastic performance drop in this setting and propose PAPNet, a baseline method that integrates supervised object pose estimation with classification, achieving superior results compared to existing methods.

Object point cloud classification has drawn great research attention since the release of benchmarking datasets, such as the ModelNet and the ShapeNet. These benchmarks assume point clouds covering complete surfaces of object instances, for which plenty of high-performing methods have been developed. However, their settings deviate from those often met in practice, where, due to (self-)occlusion, a point cloud covering partial surface of an object is captured from an arbitrary view. We show in this paper that performance of existing point cloud classifiers drops drastically under the considered single-view, partial setting; the phenomenon is consistent with the observation that semantic category of a partial object surface is less ambiguous only when its distribution on the whole surface is clearly specified. To this end, we argue for a single-view, partial setting where supervised learning of object pose estimation should be accompanied with classification. Technically, we propose a baseline method of Pose-Accompanied Point cloud classification Network (PAPNet); built upon SE(3)-equivariant convolutions, the PAPNet learns intermediate pose transformations for equivariant features defined on vector fields, which makes the subsequent classification easier (ideally) in the category-level, canonical pose. By adapting existing ModelNet40 and ScanNet datasets to the single-view, partial setting, experiment results can verify the necessity of object pose estimation and superiority of our PAPNet to existing classifiers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes