CVApr 2, 2024

Towards Robust 3D Pose Transfer with Adversarial Learning

arXiv:2404.02242v16 citationsh-index: 5CVPR
Originality Highly original
AI Analysis

This work addresses the problem of cumbersome pre-processing for real-time 3D pose transfer applications, offering a more robust solution for handling noisy real-world data.

The paper tackles the challenge of 3D pose transfer by enhancing model robustness to noisy inputs, such as raw point clouds, using adversarial learning and a novel 3D-PoseMAE method, resulting in much better quality transferred meshes and strong generalizability across various poses and domains.

3D pose transfer that aims to transfer the desired pose to a target mesh is one of the most challenging 3D generation tasks. Previous attempts rely on well-defined parametric human models or skeletal joints as driving pose sources. However, to obtain those clean pose sources, cumbersome but necessary pre-processing pipelines are inevitable, hindering implementations of the real-time applications. This work is driven by the intuition that the robustness of the model can be enhanced by introducing adversarial samples into the training, leading to a more invulnerable model to the noisy inputs, which even can be further extended to directly handling the real-world data like raw point clouds/scans without intermediate processing. Furthermore, we propose a novel 3D pose Masked Autoencoder (3D-PoseMAE), a customized MAE that effectively learns 3D extrinsic presentations (i.e., pose). 3D-PoseMAE facilitates learning from the aspect of extrinsic attributes by simultaneously generating adversarial samples that perturb the model and learning the arbitrary raw noisy poses via a multi-scale masking strategy. Both qualitative and quantitative studies show that the transferred meshes given by our network result in much better quality. Besides, we demonstrate the strong generalizability of our method on various poses, different domains, and even raw scans. Experimental results also show meaningful insights that the intermediate adversarial samples generated in the training can successfully attack the existing pose transfer models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes