CVAILGRODec 6, 2024

Extrapolated Urban View Synthesis Benchmark

arXiv:2412.05256v310 citationsh-index: 32
Originality Synthesis-oriented
AI Analysis

This addresses the need for more robust view synthesis in autonomous vehicle simulation, though it is incremental as it focuses on evaluation rather than new methods.

The authors identified that current novel view synthesis methods for autonomous vehicle simulation are evaluated primarily on interpolated views rather than extrapolated views, which limits progress in generalizable simulation technology. They created the first Extrapolated Urban View Synthesis benchmark and found that state-of-the-art methods overfit to training views and cannot fundamentally improve under large view changes, even with diffusion priors or better geometry.

Photorealistic simulators are essential for the training and evaluation of vision-centric autonomous vehicles (AVs). At their core is Novel View Synthesis (NVS), a crucial capability that generates diverse unseen viewpoints to accommodate the broad and continuous pose distribution of AVs. Recent advances in radiance fields, such as 3D Gaussian Splatting, achieve photorealistic rendering at real-time speeds and have been widely used in modeling large-scale driving scenes. However, their performance is commonly evaluated using an interpolated setup with highly correlated training and test views. In contrast, extrapolation, where test views largely deviate from training views, remains underexplored, limiting progress in generalizable simulation technology. To address this gap, we leverage publicly available AV datasets with multiple traversals, multiple vehicles, and multiple cameras to build the first Extrapolated Urban View Synthesis (EUVS) benchmark. Meanwhile, we conduct both quantitative and qualitative evaluations of state-of-the-art NVS methods across different evaluation settings. Our results show that current NVS methods are prone to overfitting to training views. Besides, incorporating diffusion priors and improving geometry cannot fundamentally improve NVS under large view changes, highlighting the need for more robust approaches and large-scale training. We will release the data to help advance self-driving and urban robotics simulation technology.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes