CVAIGRMay 21, 2021

Driving-Signal Aware Full-Body Avatars

arXiv:2105.10441v2114 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of virtual telepresence with minimal sensors, though it appears incremental as it builds on existing conditional variational autoencoder methods.

The authors tackled the problem of animating full-body avatars with incomplete driving signals like pose and facial keypoints, achieving a high-quality representation of human geometry and view-dependent appearance by disentangling signals and imputing missing factors.

We present a learning-based method for building driving-signal aware full-body avatars. Our model is a conditional variational autoencoder that can be animated with incomplete driving signals, such as human pose and facial keypoints, and produces a high-quality representation of human geometry and view-dependent appearance. The core intuition behind our method is that better drivability and generalization can be achieved by disentangling the driving signals and remaining generative factors, which are not available during animation. To this end, we explicitly account for information deficiency in the driving signal by introducing a latent space that exclusively captures the remaining information, thus enabling the imputation of the missing factors required during full-body animation, while remaining faithful to the driving signal. We also propose a learnable localized compression for the driving signal which promotes better generalization, and helps minimize the influence of global chance-correlations often found in real datasets. For a given driving signal, the resulting variational model produces a compact space of uncertainty for missing factors that allows for an imputation strategy best suited to a particular application. We demonstrate the efficacy of our approach on the challenging problem of full-body animation for virtual telepresence with driving signals acquired from minimal sensors placed in the environment and mounted on a VR-headset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes