CVGRROJan 9, 2023

Locomotion-Action-Manipulation: Synthesizing Human-Scene Interactions in Complex 3D Environments

arXiv:2301.02667v255 citationsh-index: 23
AI Analysis

This work addresses the challenge of generating diverse human-scene interactions for applications in animation, virtual reality, or robotics, representing an incremental improvement over existing methods.

The paper tackles the problem of synthesizing natural and plausible long-term human movements in complex indoor environments, presenting LAMA as a unified framework that outperforms previous approaches in realistic motion synthesis across various challenging scenarios.

Synthesizing interaction-involved human motions has been challenging due to the high complexity of 3D environments and the diversity of possible human behaviors within. We present LAMA, Locomotion-Action-MAnipulation, to synthesize natural and plausible long-term human movements in complex indoor environments. The key motivation of LAMA is to build a unified framework to encompass a series of everyday motions including locomotion, scene interaction, and object manipulation. Unlike existing methods that require motion data "paired" with scanned 3D scenes for supervision, we formulate the problem as a test-time optimization by using human motion capture data only for synthesis. LAMA leverages a reinforcement learning framework coupled with a motion matching algorithm for optimization, and further exploits a motion editing framework via manifold learning to cover possible variations in interaction and manipulation. Throughout extensive experiments, we demonstrate that LAMA outperforms previous approaches in synthesizing realistic motions in various challenging scenarios. Project page: https://jiyewise.github.io/projects/LAMA/ .

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes