CVFeb 12

Code2Worlds: Empowering Coding LLMs for 4D World Generation

arXiv:2602.11757v15 citationsh-index: 1Has Code
Originality Incremental advance
AI Analysis

This work advances spatial intelligence for AI applications like robotics and simulation by enabling physics-aware dynamic world generation, though it builds incrementally on existing coding LLM methods.

The paper tackles the problem of generating 4D dynamic worlds from language descriptions by addressing challenges like multi-scale context entanglement and semantic-physical execution gaps, achieving a 41% SGS gain and 49% higher Richness on the Code4D benchmark.

Achieving spatial intelligence requires moving beyond visual plausibility to build world simulators grounded in physical laws. While coding LLMs have advanced static 3D scene generation, extending this paradigm to 4D dynamics remains a critical frontier. This task presents two fundamental challenges: multi-scale context entanglement, where monolithic generation fails to balance local object structures with global environmental layouts; and a semantic-physical execution gap, where open-loop code generation leads to physical hallucinations lacking dynamic fidelity. We introduce Code2Worlds, a framework that formulates 4D generation as language-to-simulation code generation. First, we propose a dual-stream architecture that disentangles retrieval-augmented object generation from hierarchical environmental orchestration. Second, to ensure dynamic fidelity, we establish a physics-aware closed-loop mechanism in which a PostProcess Agent scripts dynamics, coupled with a VLM-Motion Critic that performs self-reflection to iteratively refine simulation code. Evaluations on the Code4D benchmark show Code2Worlds outperforms baselines with a 41% SGS gain and 49% higher Richness, while uniquely generating physics-aware dynamics absent in prior static methods. Code: https://github.com/AIGeeksGroup/Code2Worlds. Website: https://aigeeksgroup.github.io/Code2Worlds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes