CVFeb 16, 2024

PointMamba: A Simple State Space Model for Point Cloud Analysis

arXiv:2402.10739v5291 citationsh-index: 21Has CodeNIPS
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in 3D vision tasks for researchers and practitioners, offering an incremental improvement by adapting an existing method to a new domain.

The paper tackles the problem of high computational complexity in point cloud analysis by proposing PointMamba, a state space model with linear complexity, which achieves superior performance on multiple datasets while significantly reducing GPU memory usage and FLOPs.

Transformers have become one of the foundational architectures in point cloud analysis tasks due to their excellent global modeling ability. However, the attention mechanism has quadratic complexity, making the design of a linear complexity method with global modeling appealing. In this paper, we propose PointMamba, transferring the success of Mamba, a recent representative state space model (SSM), from NLP to point cloud analysis tasks. Unlike traditional Transformers, PointMamba employs a linear complexity algorithm, presenting global modeling capacity while significantly reducing computational costs. Specifically, our method leverages space-filling curves for effective point tokenization and adopts an extremely simple, non-hierarchical Mamba encoder as the backbone. Comprehensive evaluations demonstrate that PointMamba achieves superior performance across multiple datasets while significantly reducing GPU memory usage and FLOPs. This work underscores the potential of SSMs in 3D vision-related tasks and presents a simple yet effective Mamba-based baseline for future research. The code will be made available at \url{https://github.com/LMD0311/PointMamba}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes