CVJan 16, 2024

Efficient4D: Fast Dynamic 3D Object Generation from a Single-view Video

arXiv:2401.08742v452 citationsInt J Comput Vis
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient 4D content creation for applications like virtual reality or animation, though it appears incremental as it builds on existing pipelines with optimizations.

The paper tackles the problem of generating dynamic 3D objects from single-view videos by proposing Efficient4D, which achieves a 10-fold speed increase over prior methods while preserving quality, taking only 10 minutes versus 120 minutes for a comparable task.

Generating dynamic 3D object from a single-view video is challenging due to the lack of 4D labeled data. An intuitive approach is to extend previous image-to-3D pipelines by transferring off-the-shelf image generation models such as score distillation sampling.However, this approach would be slow and expensive to scale due to the need for back-propagating the information-limited supervision signals through a large pretrained model. To address this, we propose an efficient video-to-4D object generation framework called Efficient4D. It generates high-quality spacetime-consistent images under different camera views, and then uses them as labeled data to directly reconstruct the 4D content through a 4D Gaussian splatting model. Importantly, our method can achieve real-time rendering under continuous camera trajectories. To enable robust reconstruction under sparse views, we introduce inconsistency-aware confidence-weighted loss design, along with a lightly weighted score distillation loss. Extensive experiments on both synthetic and real videos show that Efficient4D offers a remarkable 10-fold increase in speed when compared to prior art alternatives while preserving the quality of novel view synthesis. For example, Efficient4D takes only 10 minutes to model a dynamic object, vs 120 minutes by the previous art model Consistent4D.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes