CVAIMar 6

Text-Driven Emotionally Continuous Talking Face Generation

arXiv:2603.06071v1h-index: 2
Predicted impact top 45% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the limitation of previous methods in producing natural, continuously changing expressions for applications like virtual avatars or entertainment, though it is incremental by focusing on emotional dynamics.

The paper tackles the problem of generating talking face videos with fixed emotions by proposing Emotionally Continuous Talking Face Generation (EC-TFG), which uses text and varying emotion descriptions to create videos with smooth emotional transitions, achieving high-quality visuals and motion authenticity.

Talking Face Generation (TFG) strives to create realistic and emotionally expressive digital faces. While previous TFG works have mastered the creation of naturalistic facial movements, they typically express a fixed target emotion in synthetic videos and lack the ability to exhibit continuously changing and natural expressions like humans do when conveying information. To synthesize realistic videos, we propose a novel task called Emotionally Continuous Talking Face Generation (EC-TFG), which takes a text segment and an emotion description with varying emotions as driving data, aiming to generate a video where the person speaks the text while reflecting the emotional changes within the description. Alongside this, we introduce a customized model, i.e., Temporal-Intensive Emotion Modulated Talking Face Generation (TIE-TFG), which innovatively manages dynamic emotional variations by employing Temporal-Intensive Emotion Fluctuation Modeling, allowing it to provide emotion variation sequences corresponding to the input text to drive continuous facial expression changes in synthesized videos. Extensive evaluations demonstrate our method's exceptional ability to produce smooth emotion transitions and uphold high-quality visuals and motion authenticity across diverse emotional states.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes