CVAIGRFeb 21, 2025

WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents

arXiv:2502.15601v219 citationsh-index: 11
Originality Incremental advance
AI Analysis

This system democratizes 3D world creation for non-professionals by automating tasks that usually need skilled labor, though it builds incrementally on existing LLM and procedural generation techniques.

The paper tackles the problem of creating photorealistic 3D virtual worlds, which typically requires professional expertise, by introducing WorldCraft, a system that uses LLM agents to generate and customize scenes via natural language commands, enabling non-professionals to design complex indoor and outdoor environments.

Constructing photorealistic virtual worlds has applications across various fields, but it often requires the extensive labor of highly trained professionals to operate conventional 3D modeling software. To democratize this process, we introduce WorldCraft, a system where large language model (LLM) agents leverage procedural generation to create indoor and outdoor scenes populated with objects, allowing users to control individual object attributes and the scene layout using intuitive natural language commands. In our framework, a coordinator agent manages the overall process and works with two specialized LLM agents to complete the scene creation: ForgeIt, which integrates an ever-growing manual through auto-verification to enable precise customization of individual objects, and ArrangeIt, which formulates hierarchical optimization problems to achieve a layout that balances ergonomic and aesthetic considerations. Additionally, our pipeline incorporates a trajectory control agent, allowing users to animate the scene and operate the camera through natural language interactions. Our system is also compatible with off-the-shelf deep 3D generators to enrich scene assets. Through evaluations and comparisons with state-of-the-art methods, we demonstrate the versatility of WorldCraft, ranging from single-object customization to intricate, large-scale interior and exterior scene designs. This system empowers non-professionals to bring their creative visions to life.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes