CVIVMay 7, 2020

What comprises a good talking-head video generation?: A Survey and Benchmark

arXiv:2005.03201v172 citationsHas Code
AI Analysis

This work addresses the problem of subjective and unreproducible evaluations for researchers in computer vision, though it is incremental as it builds on existing methods by providing a benchmark.

The authors tackled the lack of standardized evaluation in talking-head video generation by creating a benchmark with new or selected metrics for identity preservation, lip sync, video quality, and natural motion, analyzing state-of-the-art methods to reveal their strengths and weaknesses.

Over the years, performance evaluation has become essential in computer vision, enabling tangible progress in many sub-fields. While talking-head video generation has become an emerging research topic, existing evaluations on this topic present many limitations. For example, most approaches use human subjects (e.g., via Amazon MTurk) to evaluate their research claims directly. This subjective evaluation is cumbersome, unreproducible, and may impend the evolution of new research. In this work, we present a carefully-designed benchmark for evaluating talking-head video generation with standardized dataset pre-processing strategies. As for evaluation, we either propose new metrics or select the most appropriate ones to evaluate results in what we consider as desired properties for a good talking-head video, namely, identity preserving, lip synchronization, high video quality, and natural-spontaneous motion. By conducting a thoughtful analysis across several state-of-the-art talking-head generation approaches, we aim to uncover the merits and drawbacks of current methods and point out promising directions for future work. All the evaluation code is available at: https://github.com/lelechen63/talking-head-generation-survey.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes