CVMay 7, 2025

GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model

arXiv:2505.04119v36 citationsh-index: 6Has CodeICML
Originality Highly original
AI Analysis

This work addresses the computational and storage inefficiencies in adapting 3D vision models for downstream tasks, offering a more efficient solution for researchers and practitioners in 3D computer vision.

The paper tackles the challenge of efficiently fine-tuning pre-trained 3D vision models for point cloud tasks by proposing GAPrompt, a geometry-aware prompting method that uses only 2.19% of trainable parameters and outperforms state-of-the-art parameter-efficient fine-tuning approaches while achieving competitive results compared to full fine-tuning.

Pre-trained 3D vision models have gained significant attention for their promising performance on point cloud data. However, fully fine-tuning these models for downstream tasks is computationally expensive and storage-intensive. Existing parameter-efficient fine-tuning (PEFT) approaches, which focus primarily on input token prompting, struggle to achieve competitive performance due to their limited ability to capture the geometric information inherent in point clouds. To address this challenge, we propose a novel Geometry-Aware Point Cloud Prompt (GAPrompt) that leverages geometric cues to enhance the adaptability of 3D vision models. First, we introduce a Point Prompt that serves as an auxiliary input alongside the original point cloud, explicitly guiding the model to capture fine-grained geometric details. Additionally, we present a Point Shift Prompter designed to extract global shape information from the point cloud, enabling instance-specific geometric adjustments at the input level. Moreover, our proposed Prompt Propagation mechanism incorporates the shape information into the model's feature extraction process, further strengthening its ability to capture essential geometric characteristics. Extensive experiments demonstrate that GAPrompt significantly outperforms state-of-the-art PEFT methods and achieves competitive results compared to full fine-tuning on various benchmarks, while utilizing only 2.19% of trainable parameters. Our code is available at https://github.com/zhoujiahuan1991/ICML2025-GAPrompt.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes