Rotation Transformation Network: Learning View-Invariant Point Cloud for Classification and Segmentation
This addresses the challenge of view-invariant processing for 3D point clouds in computer vision, offering an incremental enhancement to existing deep neural networks.
The paper tackled the problem of 3D point cloud analysis being sensitive to object rotations by proposing a rotation transformation network (RTN) that reduces rotational degrees of freedom to zero, leading to significant performance improvements in classification and segmentation tasks.
Many recent works show that a spatial manipulation module could boost the performances of deep neural networks (DNNs) for 3D point cloud analysis. In this paper, we aim to provide an insight into spatial manipulation modules. Firstly, we find that the smaller the rotational degree of freedom (RDF) of objects is, the more easily these objects are handled by these DNNs. Then, we investigate the effect of the popular T-Net module and find that it could not reduce the RDF of objects. Motivated by the above two issues, we propose a rotation transformation network for point cloud analysis, called RTN, which could reduce the RDF of input 3D objects to 0. The RTN could be seamlessly inserted into many existing DNNs for point cloud analysis. Extensive experimental results on 3D point cloud classification and segmentation tasks demonstrate that the proposed RTN could improve the performances of several state-of-the-art methods significantly.