SphereNet: Learning a Noise-Robust and General Descriptor for Point Cloud Registration
This addresses the challenge of noise susceptibility and poor generalization in learning-based point cloud registration, which is crucial for applications like robotics and 3D scanning, and represents a strong specific gain rather than an incremental improvement.
The paper tackles the problem of point cloud registration by introducing SphereNet, a descriptor that is robust to noise and generalizes well to unseen datasets, achieving a 25 percentage point increase in feature matching recall under high-intensity noise and setting new state-of-the-art performance on benchmarks with 93.5% and 75.6% registration recall.
Point cloud registration is to estimate a transformation to align point clouds collected in different perspectives. In learning-based point cloud registration, a robust descriptor is vital for high-accuracy registration. However, most methods are susceptible to noise and have poor generalization ability on unseen datasets. Motivated by this, we introduce SphereNet to learn a noise-robust and unseen-general descriptor for point cloud registration. In our method, first, the spheroid generator builds a geometric domain based on spherical voxelization to encode initial features. Then, the spherical interpolation of the sphere is introduced to realize robustness against noise. Finally, a new spherical convolutional neural network with spherical integrity padding completes the extraction of descriptors, which reduces the loss of features and fully captures the geometric features. To evaluate our methods, a new benchmark 3DMatch-noise with strong noise is introduced. Extensive experiments are carried out on both indoor and outdoor datasets. Under high-intensity noise, SphereNet increases the feature matching recall by more than 25 percentage points on 3DMatch-noise. In addition, it sets a new state-of-the-art performance for the 3DMatch and 3DLoMatch benchmarks with 93.5\% and 75.6\% registration recall and also has the best generalization ability on unseen datasets.