CVAINov 9, 2024

PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation

arXiv:2411.06041v14 citationsh-index: 19Has CodeIEEE Trans Vis Comput Graph
Originality Incremental advance
AI Analysis

This work addresses the challenge of ambiguous supervision and geometric insensitivity in self-supervised point cloud learning for 3D computer vision applications, representing an incremental advancement.

The paper tackles the problem of self-supervised point cloud learning by integrating masked point modeling and 3D-to-2D generation into a pre-training framework called PointCG, which improves encoder perception of 3D objects and shows superiority over baselines in downstream tasks.

The core of self-supervised point cloud learning lies in setting up appropriate pretext tasks, to construct a pre-training framework that enables the encoder to perceive 3D objects effectively. In this paper, we integrate two prevalent methods, masked point modeling (MPM) and 3D-to-2D generation, as pretext tasks within a pre-training framework. We leverage the spatial awareness and precise supervision offered by these two methods to address their respective limitations: ambiguous supervision signals and insensitivity to geometric information. Specifically, the proposed framework, abbreviated as PointCG, consists of a Hidden Point Completion (HPC) module and an Arbitrary-view Image Generation (AIG) module. We first capture visible points from arbitrary views as inputs by removing hidden points. Then, HPC extracts representations of the inputs with an encoder and completes the entire shape with a decoder, while AIG is used to generate rendered images based on the visible points' representations. Extensive experiments demonstrate the superiority of the proposed method over the baselines in various downstream tasks. Our code will be made available upon acceptance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes