CVOct 11, 2024

Foundation Model-Powered 3D Few-Shot Class Incremental Learning via Training-free Adaptor

arXiv:2410.09237v13 citationsh-index: 11Has CodeACCV
Originality Highly original
AI Analysis

This addresses incremental learning challenges in 3D computer vision, offering a versatile solution for scenarios with limited new data.

The paper tackles the Few-Shot Class Incremental Learning problem in 3D point cloud environments by leveraging a foundational 3D model with a training-free adaptor, achieving outperformance over other methods on datasets like ModelNet, ShapeNet, ScanObjectNN, and CO3D.

Recent advances in deep learning for processing point clouds hold increased interest in Few-Shot Class Incremental Learning (FSCIL) for 3D computer vision. This paper introduces a new method to tackle the Few-Shot Continual Incremental Learning (FSCIL) problem in 3D point cloud environments. We leverage a foundational 3D model trained extensively on point cloud data. Drawing from recent improvements in foundation models, known for their ability to work well across different tasks, we propose a novel strategy that does not require additional training to adapt to new tasks. Our approach uses a dual cache system: first, it uses previous test samples based on how confident the model was in its predictions to prevent forgetting, and second, it includes a small number of new task samples to prevent overfitting. This dynamic adaptation ensures strong performance across different learning tasks without needing lots of fine-tuning. We tested our approach on datasets like ModelNet, ShapeNet, ScanObjectNN, and CO3D, showing that it outperforms other FSCIL methods and demonstrating its effectiveness and versatility. The code is available at \url{https://github.com/ahmadisahar/ACCV_FCIL3D}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes