ROCVGRNov 29, 2024

One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering

arXiv:2412.00259v45 citationsh-index: 2IEEE Robot Autom Lett
Originality Highly original
AI Analysis

This addresses the challenge for robotics in enabling task planning and execution in new settings, representing a novel method rather than an incremental improvement.

The paper tackles the problem of learning predictive world models for robots in novel environments from sparse online observations, achieving joint optimization of geometry, appearance, and physical properties from only one robot action sequence.

Identifying predictive world models for robots in novel environments from sparse online observations is essential for robot task planning and execution in novel environments. However, existing methods that leverage differentiable programming to identify world models are incapable of jointly optimizing the geometry, appearance, and physical properties of the scene. In this work, we introduce a novel rigid object representation that allows the joint identification of these properties. Our method employs a novel differentiable point-based geometry representation coupled with a grid-based appearance field, which allows differentiable object collision detection and rendering. Combined with a differentiable physical simulator, we achieve end-to-end optimization of world models, given the sparse visual and tactile observations of a physical motion sequence. Through a series of world model identification tasks in simulated and real environments, we show that our method can learn both simulation- and rendering-ready world models from only one robot action sequence. The code and additional videos are available at our project website: https://tianyi20.github.io/rigid-world-model.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes