CVJan 15, 2023

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

Peking U
arXiv:2301.06015v1323 citationsh-index: 32
Originality Incremental advance
AI Analysis

This addresses the need for integrated scene-aware models in robotics and computer vision, though it appears to be an incremental advancement combining diffusion models with existing 3D understanding tasks.

The paper tackles the problem of 3D scene understanding by introducing SceneDiffuser, a unified diffusion-based model for scene-conditioned generation, optimization, and planning. The results show significant improvements over previous models across tasks like human motion generation, grasp generation, and robot motion planning.

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding. SceneDiffuser provides a unified model for solving scene-conditioned generation, optimization, and planning. In contrast to prior works, SceneDiffuser is intrinsically scene-aware, physics-based, and goal-oriented. With an iterative sampling strategy, SceneDiffuser jointly formulates the scene-aware generation, physics-based optimization, and goal-oriented planning via a diffusion-based denoising process in a fully differentiable fashion. Such a design alleviates the discrepancies among different modules and the posterior collapse of previous scene-conditioned generative models. We evaluate SceneDiffuser with various 3D scene understanding tasks, including human pose and motion generation, dexterous grasp generation, path planning for 3D navigation, and motion planning for robot arms. The results show significant improvements compared with previous models, demonstrating the tremendous potential of SceneDiffuser for the broad community of 3D scene understanding.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes