CVAIApr 13, 2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis

arXiv:2204.06160v195 citationsh-index: 28Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of generating realistic person images with explicit pose and appearance control for applications in graphics and AI, representing an incremental improvement.

The paper tackles controllable person image synthesis by re-rendering humans from reference images with control over pose and appearance, achieving superior results in experimental comparisons.

We deal with the controllable person image synthesis task which aims to re-render a human from a reference image with explicit control over body pose and appearance. Observing that person images are highly structured, we propose to generate desired images by extracting and distributing semantic entities of reference images. To achieve this goal, a neural texture extraction and distribution operation based on double attention is described. This operation first extracts semantic neural textures from reference feature maps. Then, it distributes the extracted neural textures according to the spatial distributions learned from target poses. Our model is trained to predict human images in arbitrary poses, which encourages it to extract disentangled and expressive neural textures representing the appearance of different semantic entities. The disentangled representation further enables explicit appearance control. Neural textures of different reference images can be fused to control the appearance of the interested areas. Experimental comparisons show the superiority of the proposed model. Code is available at https://github.com/RenYurui/Neural-Texture-Extraction-Distribution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes