CVJan 15

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation

arXiv:2601.10606v1h-index: 9
Originality Incremental advance
AI Analysis

This addresses the problem of creating socially-aware virtual avatars for VR applications, though it is incremental by building on existing 3DGS methods.

The paper tackles generating realistic talking heads for multi-turn conversations by introducing RSATalker, a framework that uses 3D Gaussian Splatting to render high-fidelity avatars and incorporates social relationships, achieving state-of-the-art performance in realism and social awareness.

Talking head generation is increasingly important in virtual reality (VR), especially for social scenarios involving multi-turn conversation. Existing approaches face notable limitations: mesh-based 3D methods can model dual-person dialogue but lack realistic textures, while large-model-based 2D methods produce natural appearances but incur prohibitive computational costs. Recently, 3D Gaussian Splatting (3DGS) based methods achieve efficient and realistic rendering but remain speaker-only and ignore social relationships. We introduce RSATalker, the first framework that leverages 3DGS for realistic and socially-aware talking head generation with support for multi-turn conversation. Our method first drives mesh-based 3D facial motion from speech, then binds 3D Gaussians to mesh facets to render high-fidelity 2D avatar videos. To capture interpersonal dynamics, we propose a socially-aware module that encodes social relationships, including blood and non-blood as well as equal and unequal, into high-level embeddings through a learnable query mechanism. We design a three-stage training paradigm and construct the RSATalker dataset with speech-mesh-image triplets annotated with social relationships. Extensive experiments demonstrate that RSATalker achieves state-of-the-art performance in both realism and social awareness. The code and dataset will be released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes