CVJul 5, 2025

Voyaging into Perpetual Dynamic Scenes from a Single View

arXiv:2507.04183v23 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses a key challenge for applications in augmented/virtual reality and robotics by enabling perpetual scene generation from limited input, though it appears incremental as it builds on prior outpainting and 3D methods.

The paper tackles the problem of generating perpetual dynamic scenes from a single view by proposing DynamicVoyager, which reformulates it as a scene outpainting problem with 3D motion consistency, and results show it can generate scenes with consistent motions along fly-through cameras.

The problem of generating a perpetual dynamic scene from a single view is an important problem with widespread applications in augmented and virtual reality, and robotics. However, since dynamic scenes regularly change over time, a key challenge is to ensure that different generated views be consistent with the underlying 3D motions. Prior work learns such consistency by training on multiple views, but the generated scene regions often interpolate between training views and fail to generate perpetual views. To address this issue, we propose DynamicVoyager, which reformulates dynamic scene generation as a scene outpainting problem with new dynamic content. As 2D outpainting models struggle at generating 3D consistent motions from a single 2D view, we enrich 2D pixels with information from their 3D rays that facilitates learning of 3D motion consistency. More specifically, we first map the single-view video input to a dynamic point cloud using the estimated video depths. We then render a partial video of the point cloud from a novel view and outpaint the missing regions using ray information (e.g., the distance from a ray to the point cloud) to generate 3D consistent motions. Next, we use the outpainted video to update the point cloud, which is used for outpainting the scene from future novel views. Moreover, we can control the generated content with the input text prompt. Experiments show that our model can generate perpetual scenes with consistent motions along fly-through cameras. Project page: https://tianfr.github.io/DynamicVoyager.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes