LG CVMay 22, 2024

Learning rigid-body simulators over implicit shapes for large-scale scenes and vision

Yulia Rubanova, Tatiana Lopez-Guevara, Kelsey R. Allen, William F. Whitney, Kimberly Stachenfeld, Tobias Pfaff

DeepMind

arXiv:2405.14045v114.213 citationsh-index: 13NIPS

Originality Incremental advance

AI Analysis

This addresses a bottleneck in robotics, engineering, and entertainment by enabling efficient simulation of large-scale scenes, though it is an incremental improvement over existing learned simulators.

The paper tackles the problem of scaling learned rigid-body simulators to large scenes with many objects by introducing SDF-Sim, which uses learned signed-distance functions to represent shapes and speed up distance computation, enabling simulation of scenes with hundreds of objects and up to 1.1 million nodes where mesh-based approaches fail.

Simulating large scenes with many rigid objects is crucial for a variety of applications, such as robotics, engineering, film and video games. Rigid interactions are notoriously hard to model: small changes to the initial state or the simulation parameters can lead to large changes in the final state. Recently, learned simulators based on graph networks (GNNs) were developed as an alternative to hand-designed simulators like MuJoCo and PyBullet. They are able to accurately capture dynamics of real objects directly from real-world observations. However, current state-of-the-art learned simulators operate on meshes and scale poorly to scenes with many objects or detailed shapes. Here we present SDF-Sim, the first learned rigid-body simulator designed for scale. We use learned signed-distance functions (SDFs) to represent the object shapes and to speed up distance computation. We design the simulator to leverage SDFs and avoid the fundamental bottleneck of the previous simulators associated with collision detection. For the first time in literature, we demonstrate that we can scale the GNN-based simulators to scenes with hundreds of objects and up to 1.1 million nodes, where mesh-based approaches run out of memory. Finally, we show that SDF-Sim can be applied to real world scenes by extracting SDFs from multi-view images.

View on arXiv PDF

Similar