CVMay 6

PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World

arXiv:2605.0516352.5
Predicted impact top 7% in CV · last 90 daysOriginality Incremental advance
AI Analysis

It addresses the bottleneck of generating interactive 3D assets with physical properties, crucial for virtual worlds and embodied AI.

PhysForge generates physics-grounded 3D assets with functional properties for interactive virtual worlds, using a two-stage framework with a VLM planner and a diffusion model, achieving simulation-ready assets with precise kinematic parameters.

Synthesizing physics-grounded 3D assets is a critical bottleneck for interactive virtual worlds and embodied AI. Existing methods predominantly focus on static geometry, overlooking the functional properties essential for interaction. We propose that interactive asset generation must be rooted in functional logic and hierarchical physics. To bridge this gap, we introduce PhysForge, a decoupled two-stage framework supported by PhysDB, a large-scale dataset of 150,000 assets with four-tier physical annotations. First, a VLM acts as a "physical architect" to plan a "Hierarchical Physical Blueprint" defining material, functional, and kinematic constraints. Second, a physics-grounded diffusion model realizes this blueprint by synthesizing high-fidelity geometry alongside precise kinematic parameters via a novel KineVoxel Injection (KVI) mechanism. Experiments demonstrate that PhysForge produces functionally plausible, simulation-ready assets, providing a robust data engine for interactive 3D content and embodied agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes