CVGRJan 3, 2023

Procedural Humans for Computer Vision

arXiv:2301.01161v15 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses the domain-gap and realism issues in human-centered synthetic data for computer vision applications like autonomous driving, though it is incremental as it builds on prior methods.

The authors tackled the challenge of generating realistic full-body synthetic human data for computer vision by extending an existing pipeline to create images with ground-truth annotations, resulting in a method for training deep neural networks to regress dense body landmarks and fit a parametric model to these predictions.

Recent work has shown the benefits of synthetic data for use in computer vision, with applications ranging from autonomous driving to face landmark detection and reconstruction. There are a number of benefits of using synthetic data from privacy preservation and bias elimination to quality and feasibility of annotation. Generating human-centered synthetic data is a particular challenge in terms of realism and domain-gap, though recent work has shown that effective machine learning models can be trained using synthetic face data alone. We show that this can be extended to include the full body by building on the pipeline of Wood et al. to generate synthetic images of humans in their entirety, with ground-truth annotations for computer vision applications. In this report we describe how we construct a parametric model of the face and body, including articulated hands; our rendering pipeline to generate realistic images of humans based on this body model; an approach for training DNNs to regress a dense set of landmarks covering the entire body; and a method for fitting our body model to dense landmarks predicted from multiple views.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes