HCAICVGRSep 19, 2019

A High-Fidelity Open Embodied Avatar with Lip Syncing and Expression Capabilities

arXiv:1909.08766v240 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work provides an open-source tool for developers and researchers to create virtual agents with non-verbal social cues, though it is incremental as it builds upon existing engines and APIs.

The authors tackled the problem of creating a high-fidelity open embodied avatar with lip syncing and expression capabilities, resulting in a system built on Unreal Engine that can be controlled via a Python interface and includes features like phoneme control, head gestures, and facial expressions, with code and models released publicly.

Embodied avatars as virtual agents have many applications and provide benefits over disembodied agents, allowing non-verbal social and interactional cues to be leveraged, in a similar manner to how humans interact with each other. We present an open embodied avatar built upon the Unreal Engine that can be controlled via a simple python programming interface. The avatar has lip syncing (phoneme control), head gesture and facial expression (using either facial action units or cardinal emotion categories) capabilities. We release code and models to illustrate how the avatar can be controlled like a puppet or used to create a simple conversational agent using public application programming interfaces (APIs). GITHUB link: https://github.com/danmcduff/AvatarSim

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes