CVSep 23, 2024

MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning

arXiv:2409.15179v12 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses the challenge of temporal consistency and image quality in face animation for video generation applications, representing an incremental improvement over existing diffusion-based methods.

The paper tackled the problem of generating temporally stable and high-quality face animation videos from public datasets by introducing a Motion-Identity Modulated Appearance Learning Module and an Inter-clip Affinity Learning Module, achieving precise motion control and faithful identity preservation.

Current diffusion-based face animation methods generally adopt a ReferenceNet (a copy of U-Net) and a large amount of curated self-acquired data to learn appearance features, as robust appearance features are vital for ensuring temporal stability. However, when trained on public datasets, the results often exhibit a noticeable performance gap in image quality and temporal consistency. To address this issue, we meticulously examine the essential appearance features in the facial animation tasks, which include motion-agnostic (e.g., clothing, background) and motion-related (e.g., facial details) texture components, along with high-level discriminative identity features. Drawing from this analysis, we introduce a Motion-Identity Modulated Appearance Learning Module (MIA) that modulates CLIP features at both motion and identity levels. Additionally, to tackle the semantic/ color discontinuities between clips, we design an Inter-clip Affinity Learning Module (ICA) to model temporal relationships across clips. Our method achieves precise facial motion control (i.e., expressions and gaze), faithful identity preservation, and generates animation videos that maintain both intra/inter-clip temporal consistency. Moreover, it easily adapts to various modalities of driving sources. Extensive experiments demonstrate the superiority of our method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes