CVJan 29, 2025

SSF: Sparse Long-Range Scene Flow for Autonomous Driving

arXiv:2501.17821v18 citationsh-index: 54Has CodeICRA
Originality Incremental advance
AI Analysis

This addresses a scalability bottleneck in 3D motion estimation for autonomous vehicles, though it appears incremental as it builds on sparse convolution backbones.

The paper tackles the problem of estimating scene flow at long ranges for autonomous driving, where existing methods struggle due to computational scaling issues, and achieves state-of-the-art results on the Argoverse2 dataset.

Scene flow enables an understanding of the motion characteristics of the environment in the 3D world. It gains particular significance in the long-range, where object-based perception methods might fail due to sparse observations far away. Although significant advancements have been made in scene flow pipelines to handle large-scale point clouds, a gap remains in scalability with respect to long-range. We attribute this limitation to the common design choice of using dense feature grids, which scale quadratically with range. In this paper, we propose Sparse Scene Flow (SSF), a general pipeline for long-range scene flow, adopting a sparse convolution based backbone for feature extraction. This approach introduces a new challenge: a mismatch in size and ordering of sparse feature maps between time-sequential point scans. To address this, we propose a sparse feature fusion scheme, that augments the feature maps with virtual voxels at missing locations. Additionally, we propose a range-wise metric that implicitly gives greater importance to faraway points. Our method, SSF, achieves state-of-the-art results on the Argoverse2 dataset, demonstrating strong performance in long-range scene flow estimation. Our code will be released at https://github.com/KTH-RPL/SSF.git.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes