CVMar 20

StreetForward: Perceiving Dynamic Street with Feedforward Causal Attention

arXiv:2603.1955252.01 citationsh-index: 4
AI Analysis

This addresses the need for efficient scene reconstruction in autonomous driving applications, though it appears incremental as it builds on existing attention mechanisms.

The authors tackled dynamic street reconstruction for autonomous driving by proposing StreetForward, a feedforward framework that eliminates per-scene optimization, achieving superior performance in novel view synthesis and depth estimation on the Waymo Open Dataset.

Feedforward reconstruction is crucial for autonomous driving applications, where rapid scene reconstruction enables efficient utilization of large-scale driving datasets in closed-loop simulation and other downstream tasks, eliminating the need for time-consuming per-scene optimization. We present StreetForward, a pose-free and tracker-free feedforward framework for dynamic street reconstruction. Building upon the alternating attention mechanism from Visual Geometry Grounded Transformer (VGGT), we propose a simple yet effective temporal mask attention module that captures dynamic motion information from image sequences and produces motion-aware latent representations. Static content and dynamic instances are represented uniformly with 3D Gaussian Splatting, and are optimized jointly by cross-frame rendering with spatio-temporal consistency, allowing the model to infer per-pixel velocities and produce high-fidelity novel views at new poses and times. We train and evaluate our model on the Waymo Open Dataset, demonstrating superior performance on novel view synthesis and depth estimation compared to existing methods. Furthermore, zero-shot inference on CARLA and other datasets validates the generalization capability of our approach. More visualizations are available on our project page: https://streetforward.github.io.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes