CVDec 30, 2025

DriveExplorer: Images-Only Decoupled 4D Reconstruction with Progressive Restoration for Driving View Extrapolation

Yuang Jia, Jinlong Wang, Jiayi Zhao, Chunlam Li, Shunzhou Wang, Wei Gao

arXiv:2512.23983v13.6h-index: 1

Originality Incremental advance

AI Analysis

This addresses the problem of costly sensor dependencies for autonomous driving view extrapolation, though it appears incremental as it builds on existing 4D Gaussian and diffusion model techniques.

The paper tackles view extrapolation in autonomous driving by developing a method that uses only images and optional camera poses to reconstruct scenes, eliminating the need for expensive sensors or annotations. The approach achieves higher-quality images at novel viewpoints compared to baselines.

This paper presents an effective solution for view extrapolation in autonomous driving scenarios. Recent approaches focus on generating shifted novel view images from given viewpoints using diffusion models. However, these methods heavily rely on priors such as LiDAR point clouds, 3D bounding boxes, and lane annotations, which demand expensive sensors or labor-intensive labeling, limiting applicability in real-world deployment. In this work, with only images and optional camera poses, we first estimate a global static point cloud and per-frame dynamic point clouds, fusing them into a unified representation. We then employ a deformable 4D Gaussian framework to reconstruct the scene. The initially trained 4D Gaussian model renders degraded and pseudo-images to train a video diffusion model. Subsequently, progressively shifted Gaussian renderings are iteratively refined by the diffusion model,and the enhanced results are incorporated back as training data for 4DGS. This process continues until extrapolation reaches the target viewpoints. Compared with baselines, our method produces higher-quality images at novel extrapolated viewpoints.

View on arXiv PDF

Similar