CVJun 21, 2022

Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery

Stanford
arXiv:2206.10457v213 citationsh-index: 55
Originality Incremental advance
AI Analysis

This addresses the problem of distribution shifts in human mesh recovery for applications in entertainment, robotics, and healthcare, but it is incremental as it builds on existing synthetic and domain adaptation methods.

The paper tackles the challenge of generalizing human mesh recovery models to real-world scenarios by proposing Domain Adaptive 3D Pose Augmentation (DAPA), a data augmentation method that improves results on benchmarks like 3DPW and AGORA.

The ability to perceive 3D human bodies from a single image has a multitude of applications ranging from entertainment and robotics to neuroscience and healthcare. A fundamental challenge in human mesh recovery is in collecting the ground truth 3D mesh targets required for training, which requires burdensome motion capturing systems and is often limited to indoor laboratories. As a result, while progress is made on benchmark datasets collected in these restrictive settings, models fail to generalize to real-world "in-the-wild" scenarios due to distribution shifts. We propose Domain Adaptive 3D Pose Augmentation (DAPA), a data augmentation method that enhances the model's generalization ability in in-the-wild scenarios. DAPA combines the strength of methods based on synthetic datasets by getting direct supervision from the synthesized meshes, and domain adaptation methods by using ground truth 2D keypoints from the target dataset. We show quantitatively that finetuning with DAPA effectively improves results on benchmarks 3DPW and AGORA. We further demonstrate the utility of DAPA on a challenging dataset curated from videos of real-world parent-child interaction.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes