CVFeb 15, 2025

SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

arXiv:2502.10841v137 citationsh-index: 18
Originality Incremental advance
AI Analysis

This work improves portrait animation for applications like virtual avatars and digital media, but it appears incremental as it builds on existing video diffusion Transformer methods.

The paper tackles the problem of portrait image animation in video diffusion Transformers, addressing issues like identity distortion and background instability, and achieves visually coherent and compositionally diverse results through expression-aware conditioning and multi-stage training.

We present SkyReels-A1, a simple yet effective framework built upon video diffusion Transformer to facilitate portrait image animation. Existing methodologies still encounter issues, including identity distortion, background instability, and unrealistic facial dynamics, particularly in head-only animation scenarios. Besides, extending to accommodate diverse body proportions usually leads to visual inconsistencies or unnatural articulations. To address these challenges, SkyReels-A1 capitalizes on the strong generative capabilities of video DiT, enhancing facial motion transfer precision, identity retention, and temporal coherence. The system incorporates an expression-aware conditioning module that enables seamless video synthesis driven by expression-guided landmark inputs. Integrating the facial image-text alignment module strengthens the fusion of facial attributes with motion trajectories, reinforcing identity preservation. Additionally, SkyReels-A1 incorporates a multi-stage training paradigm to incrementally refine the correlation between expressions and motion while ensuring stable identity reproduction. Extensive empirical evaluations highlight the model's ability to produce visually coherent and compositionally diverse results, making it highly applicable to domains such as virtual avatars, remote communication, and digital media generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes