CVAIJul 15, 2023

Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer

arXiv:2307.07754v230 citationsh-index: 37Has Code
Originality Incremental advance
AI Analysis

This work improves video generation for animating human images with new poses, offering incremental advancements in handling structural patterns and motion consistency.

The paper tackles video-based human pose transfer by addressing issues like distorted textures and flickering artifacts, proposing a Deformable Motion Modulation method that outperforms state-of-the-art techniques in image fidelity and visual continuity.

Video-based human pose transfer is a video-to-video generation task that animates a plain source human image based on a series of target human poses. Considering the difficulties in transferring highly structural patterns on the garments and discontinuous poses, existing methods often generate unsatisfactory results such as distorted textures and flickering artifacts. To address these issues, we propose a novel Deformable Motion Modulation (DMM) that utilizes geometric kernel offset with adaptive weight modulation to simultaneously perform feature alignment and style transfer. Different from normal style modulation used in style transfer, the proposed modulation mechanism adaptively reconstructs smoothed frames from style codes according to the object shape through an irregular receptive field of view. To enhance the spatio-temporal consistency, we leverage bidirectional propagation to extract the hidden motion information from a warped image sequence generated by noisy poses. The proposed feature propagation significantly enhances the motion prediction ability by forward and backward propagation. Both quantitative and qualitative experimental results demonstrate superiority over the state-of-the-arts in terms of image fidelity and visual continuity. The source code is publicly available at github.com/rocketappslab/bdmm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes