CVAIFeb 24, 2025

MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation

arXiv:2502.16907v17 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

It addresses a domain-specific problem in autonomous driving for accurate real-time flow estimation, with incremental improvements over existing methods.

The paper tackles scene flow estimation for 3D motion prediction in autonomous driving by proposing MambaFlow, a network that uses a Mamba-based decoder and scene-adaptive loss, achieving state-of-the-art performance on the Argoverse 2 benchmark with real-time inference speed.

Scene flow estimation aims to predict 3D motion from consecutive point cloud frames, which is of great interest in autonomous driving field. Existing methods face challenges such as insufficient spatio-temporal modeling and inherent loss of fine-grained feature during voxelization. However, the success of Mamba, a representative state space model (SSM) that enables global modeling with linear complexity, provides a promising solution. In this paper, we propose MambaFlow, a novel scene flow estimation network with a mamba-based decoder. It enables deep interaction and coupling of spatio-temporal features using a well-designed backbone. Innovatively, we steer the global attention modeling of voxel-based features with point offset information using an efficient Mamba-based decoder, learning voxel-to-point patterns that are used to devoxelize shared voxel representations into point-wise features. To further enhance the model's generalization capabilities across diverse scenarios, we propose a novel scene-adaptive loss function that automatically adapts to different motion patterns.Extensive experiments on the Argoverse 2 benchmark demonstrate that MambaFlow achieves state-of-the-art performance with real-time inference speed among existing works, enabling accurate flow estimation in real-world urban scenarios. The code is available at https://github.com/SCNU-RISLAB/MambaFlow.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes