SceneFoundry: Generating Interactive Infinite 3D Worlds

arXiv:2601.05810v23 citationsh-index: 5
Originality Highly original
AI Analysis

This addresses the need for physically realistic and functionally complex 3D environments for robotic training, representing a novel method for a known bottleneck in generative approaches.

The paper tackles the problem of generating large-scale, interactive 3D environments for robotic learning by introducing SceneFoundry, a language-guided diffusion framework that produces apartment-scale worlds with articulated furniture and diverse layouts, enabling scalable embodied AI research.

The ability to automatically generate large-scale, interactive, and physically realistic 3D environments is crucial for advancing robotic learning and embodied intelligence. However, existing generative approaches often fail to capture the functional complexity of real-world interiors, particularly those containing articulated objects with movable parts essential for manipulation and navigation. This paper presents SceneFoundry, a language-guided diffusion framework that generates apartment-scale 3D worlds with functionally articulated furniture and semantically diverse layouts for robotic training. From natural language prompts, an LLM module controls floor layout generation, while diffusion-based posterior sampling efficiently populates the scene with articulated assets from large-scale 3D repositories. To ensure physical usability, SceneFoundry employs differentiable guidance functions to regulate object quantity, prevent articulation collisions, and maintain sufficient walkable space for robotic navigation. Extensive experiments demonstrate that our framework generates structurally valid, semantically coherent, and functionally interactive environments across diverse scene types and conditions, enabling scalable embodied AI research. project page: https://anc891203.github.io/SceneFoundry-Demo/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes