CVGRROJan 7

Choreographing a World of Dynamic Objects

arXiv:2601.04194v11 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the challenge of scalable and category-agnostic dynamic scene generation for applications like robotics and graphics, though it builds on existing video generative models.

The paper tackles the problem of generating dynamic 4D scenes with evolving and interacting objects by introducing CHORD, a universal generative pipeline that synthesizes such phenomena, demonstrating effectiveness through experiments on multi-body dynamics and robotics manipulation policies.

Dynamic objects in our physical 4D (3D + time) world are constantly evolving, deforming, and interacting with other objects, leading to diverse 4D scene dynamics. In this paper, we present a universal generative pipeline, CHORD, for CHOReographing Dynamic objects and scenes and synthesizing this type of phenomena. Traditional rule-based graphics pipelines to create these dynamics are based on category-specific heuristics, yet are labor-intensive and not scalable. Recent learning-based methods typically demand large-scale datasets, which may not cover all object categories in interest. Our approach instead inherits the universality from the video generative models by proposing a distillation-based pipeline to extract the rich Lagrangian motion information hidden in the Eulerian representations of 2D videos. Our method is universal, versatile, and category-agnostic. We demonstrate its effectiveness by conducting experiments to generate a diverse range of multi-body 4D dynamics, show its advantage compared to existing methods, and demonstrate its applicability in generating robotics manipulation policies. Project page: https://yanzhelyu.github.io/chord

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes