OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence
This work provides a novel differentiable simulation paradigm for embodied AI, enabling gradient-based optimization in robotics tasks that were previously intractable with classical simulators.
OrbiSim introduces a fully differentiable physics engine for robotic simulation, enabling end-to-end gradient flow from state transitions to visual observations. It outperforms state-of-the-art world models in predictive fidelity and control performance, particularly for tasks like differentiable contact modeling and gradient-based policy optimization under sparse rewards.
We present OrbiSim, a novel robotic simulation paradigm that redefines world models as a fully differentiable physics engine for embodied intelligence. Unlike prior world models that focus on unconstrained imagination in latent or visual domains, OrbiSim establishes a unified, physically-grounded pathway that bridges structured scene assets, neural dynamics, and downstream reinforcement learning. By enabling end-to-end differentiability throughout the entire simulation loop -- spanning from explicit state transitions to visual observation generation -- OrbiSim supports tasks traditionally intractable for classical simulators, such as differentiable contact modeling, gradient-based policy optimization under sparse rewards, and intuitive physical inference. Empirical results demonstrate that OrbiSim significantly outperforms state-of-the-art world models in both predictive fidelity and control performance. Furthermore, its consistent responsiveness to asset configurations and physical parameters suggests its potential as a differentiable tool for enhancing robot simulation and policy training.