CVNov 24, 2021

Human Pose Manipulation and Novel View Synthesis using Differentiable Rendering

arXiv:2111.12731v2Has Code
Originality Highly original
AI Analysis

This enables motion transfer and view synthesis from single-camera inputs, advancing computer vision applications in human-centric tasks.

The paper tackles the problem of synthesizing novel views and poses of humans by introducing a differentiable renderer using diffuse Gaussian primitives, achieving highly realistic image synthesis on Human3.6M and Panoptic Studio datasets.

We present a new approach for synthesizing novel views of people in new poses. Our novel differentiable renderer enables the synthesis of highly realistic images from any viewpoint. Rather than operating over mesh-based structures, our renderer makes use of diffuse Gaussian primitives that directly represent the underlying skeletal structure of a human. Rendering these primitives gives results in a high-dimensional latent image, which is then transformed into an RGB image by a decoder network. The formulation gives rise to a fully differentiable framework that can be trained end-to-end. We demonstrate the effectiveness of our approach to image reconstruction on both the Human3.6M and Panoptic Studio datasets. We show how our approach can be used for motion transfer between individuals; novel view synthesis of individuals captured from just a single camera; to synthesize individuals from any virtual viewpoint; and to re-render people in novel poses. Code and video results are available at https://github.com/GuillaumeRochette/HumanViewSynthesis.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes