CVAug 17, 2023

Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling

arXiv:2308.08855v162 citationsh-index: 30
Originality Incremental advance
AI Analysis

This addresses the challenge of full-body tracking in VR/AR with limited hardware, though it appears incremental as it builds on existing motion data and methods.

The paper tackles the problem of realistically driving 3D full-body avatars from sparse head and hand tracking signals for VR/AR applications, achieving more accurate and smooth motion compared to existing approaches.

To bridge the physical and virtual worlds for rapidly developed VR/AR applications, the ability to realistically drive 3D full-body avatars is of great significance. Although real-time body tracking with only the head-mounted displays (HMDs) and hand controllers is heavily under-constrained, a carefully designed end-to-end neural network is of great potential to solve the problem by learning from large-scale motion data. To this end, we propose a two-stage framework that can obtain accurate and smooth full-body motions with the three tracking signals of head and hands only. Our framework explicitly models the joint-level features in the first stage and utilizes them as spatiotemporal tokens for alternating spatial and temporal transformer blocks to capture joint-level correlations in the second stage. Furthermore, we design a set of loss terms to constrain the task of a high degree of freedom, such that we can exploit the potential of our joint-level modeling. With extensive experiments on the AMASS motion dataset and real-captured data, we validate the effectiveness of our designs and show our proposed method can achieve more accurate and smooth motion compared to existing approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes