CVROAug 23, 2025

DeltaFlow: An Efficient Multi-frame Scene Flow Estimation Method

arXiv:2508.17054v26 citationsh-index: 54Has Code
Originality Incremental advance
AI Analysis

This work addresses computational inefficiency and accuracy challenges in scene flow estimation for autonomous driving applications, representing a strong specific gain rather than a foundational breakthrough.

The paper tackles the problem of inefficient multi-frame scene flow estimation by proposing DeltaFlow, a lightweight 3D framework that reduces computational costs and improves accuracy, achieving up to 22% lower error and 2x faster inference compared to previous methods.

Previous dominant methods for scene flow estimation focus mainly on input from two consecutive frames, neglecting valuable information in the temporal domain. While recent trends shift towards multi-frame reasoning, they suffer from rapidly escalating computational costs as the number of frames grows. To leverage temporal information more efficiently, we propose DeltaFlow ($Δ$Flow), a lightweight 3D framework that captures motion cues via a $Δ$ scheme, extracting temporal features with minimal computational cost, regardless of the number of frames. Additionally, scene flow estimation faces challenges such as imbalanced object class distributions and motion inconsistency. To tackle these issues, we introduce a Category-Balanced Loss to enhance learning across underrepresented classes and an Instance Consistency Loss to enforce coherent object motion, improving flow accuracy. Extensive evaluations on the Argoverse 2, Waymo and nuScenes datasets show that $Δ$Flow achieves state-of-the-art performance with up to 22% lower error and $2\times$ faster inference compared to the next-best multi-frame supervised method, while also demonstrating a strong cross-domain generalization ability. The code is open-sourced at https://github.com/Kin-Zhang/DeltaFlow along with trained model weights.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes