CVMar 29, 2023

Point2Vec for Self-Supervised Representation Learning on Point Clouds

arXiv:2303.16570v253 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses the challenge of self-supervised representation learning for 3D point clouds, which is important for computer vision and robotics applications, though it appears incremental as it adapts an existing framework to a new domain.

The authors tackled the problem of applying self-supervised learning to 3D point clouds by extending the data2vec framework, discovering that positional information leakage limits representation quality and proposing point2vec to address this. Their method outperformed other self-supervised approaches on shape classification and few-shot learning benchmarks like ModelNet40 and ScanObjectNN, achieving competitive results on part segmentation.

Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach. However, it remains open whether such a framework generalizes to the unique challenges of 3D point clouds. To answer this question, we extend data2vec to the point cloud domain and report encouraging results on several downstream tasks. In an in-depth analysis, we discover that the leakage of positional information reveals the overall object shape to the student even under heavy masking and thus hampers data2vec to learn strong representations for point clouds. We address this 3D-specific shortcoming by proposing point2vec, which unleashes the full potential of data2vec-like pre-training on point clouds. Our experiments show that point2vec outperforms other self-supervised methods on shape classification and few-shot learning on ModelNet40 and ScanObjectNN, while achieving competitive results on part segmentation on ShapeNetParts. These results suggest that the learned representations are strong and transferable, highlighting point2vec as a promising direction for self-supervised learning of point cloud representations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes