CVGRApr 20, 2023

Reconstructing Signing Avatars From Video Using Linguistic Priors

ETH Zurich
arXiv:2304.10482v119 citationsh-index: 139
Originality Highly original
AI Analysis

This work addresses a critical need for Deaf people by improving access to technology and online media through better 3D avatar generation for sign language learning and AR/VR applications.

The paper tackles the problem of reconstructing expressive 3D avatars from sign language video by introducing linguistic priors to resolve ambiguities, resulting in SGNify outperforming state-of-the-art methods and producing avatars that are as comprehensible and natural as source videos.

Sign language (SL) is the primary method of communication for the 70 million Deaf people around the world. Video dictionaries of isolated signs are a core SL learning tool. Replacing these with 3D avatars can aid learning and enable AR/VR applications, improving access to technology and online media. However, little work has attempted to estimate expressive 3D avatars from SL video; occlusion, noise, and motion blur make this task difficult. We address this by introducing novel linguistic priors that are universally applicable to SL and provide constraints on 3D hand pose that help resolve ambiguities within isolated signs. Our method, SGNify, captures fine-grained hand pose, facial expression, and body movement fully automatically from in-the-wild monocular SL videos. We evaluate SGNify quantitatively by using a commercial motion-capture system to compute 3D avatars synchronized with monocular video. SGNify outperforms state-of-the-art 3D body-pose- and shape-estimation methods on SL videos. A perceptual study shows that SGNify's 3D reconstructions are significantly more comprehensible and natural than those of previous methods and are on par with the source videos. Code and data are available at $\href{http://sgnify.is.tue.mpg.de}{\text{sgnify.is.tue.mpg.de}}$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes