CVDec 16, 2022

GFPose: Learning 3D Human Pose Prior with Gradient Fields

Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang

arXiv:2212.08641v123.495 citationsh-index: 20Has Code

Originality Highly original

AI Analysis

This work addresses the need for versatile 3D human pose modeling in AI applications, representing an incremental advance with strong specific gains.

The paper tackles the problem of learning 3D human pose priors for human-centered AI by introducing GFPose, a framework that uses a time-dependent score network to denoise perturbed poses, achieving a 20% improvement over existing SOTAs on the Human3.6M dataset as a multi-hypothesis estimator.

Learning 3D human pose prior is essential to human-centered AI. Here, we present GFPose, a versatile framework to model plausible 3D human poses for various applications. At the core of GFPose is a time-dependent score network, which estimates the gradient on each body joint and progressively denoises the perturbed 3D human pose to match a given task specification. During the denoising process, GFPose implicitly incorporates pose priors in gradients and unifies various discriminative and generative tasks in an elegant framework. Despite the simplicity, GFPose demonstrates great potential in several downstream tasks. Our experiments empirically show that 1) as a multi-hypothesis pose estimator, GFPose outperforms existing SOTAs by 20% on Human3.6M dataset. 2) as a single-hypothesis pose estimator, GFPose achieves comparable results to deterministic SOTAs, even with a vanilla backbone. 3) GFPose is able to produce diverse and realistic samples in pose denoising, completion and generation tasks. Project page https://sites.google.com/view/gfpose/

View on arXiv PDF Code

Similar