CVOct 13, 2024

Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised Learning

arXiv:2410.09886v36 citationsh-index: 14IJCAI
Originality Incremental advance
AI Analysis

This addresses the issue of domain-specific limitations in point cloud SSL for 3D computer vision applications, representing an incremental improvement.

The paper tackles the problem of limited 3D representation learning in point cloud self-supervised learning by proposing a mixture-of-domain-experts model with a block-to-scene pre-training strategy, achieving superior performance in downstream tasks.

Point clouds, as a primary representation of 3D data, can be categorized into scene domain point clouds and object domain point clouds. Point cloud self-supervised learning (SSL) has become a mainstream paradigm for learning 3D representations. However, existing point cloud SSL primarily focuses on learning domain-specific 3D representations within a single domain, neglecting the complementary nature of cross-domain knowledge, which limits the learning of 3D representations. In this paper, we propose to learn a comprehensive Point cloud Mixture-of-Domain-Experts model (Point-MoDE) via a block-to-scene pre-training strategy. Specifically, we first propose a mixture-of-domain-expert model consisting of scene domain experts and multiple shared object domain experts. Furthermore, we propose a block-to-scene pretraining strategy, which leverages the features of point blocks in the object domain to regress their initial positions in the scene domain through object-level block mask reconstruction and scene-level block position regression. By integrating the complementary knowledge between object and scene, this strategy simultaneously facilitates the learning of both object-domain and scene-domain representations, leading to a more comprehensive 3D representation. Extensive experiments in downstream tasks demonstrate the superiority of our model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes