CVLGMay 1, 2019

Automatic Dataset Augmentation Using Virtual Human Simulation

arXiv:1905.00261v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of manual annotation and data scarcity in pedestrian detection for machine learning applications, though it is incremental as it builds on existing virtual simulation techniques.

The paper tackled the problem of needing large labeled datasets for pedestrian detection by using virtual human simulation to automatically generate training data, and found that augmenting real image sequences with computer graphics images improved detection results.

Virtual Human Simulation has been widely used for different purposes, such as comfort or accessibility analysis. In this paper, we investigate the possibility of using this type of technique to extend the training datasets of pedestrians to be used with machine learning techniques. Our main goal is to verify if Computer Graphics (CG) images of virtual humans with a simplistic rendering can be efficient in order to augment datasets used for training machine learning methods. In fact, from a machine learning point of view, there is a need to collect and label large datasets for ground truth, which sometimes demands manual annotation. In addition, find out images and videos with real people and also provide ground truth of people detection and counting is not trivial. If CG images, which can have a ground truth automatically generated, can also be used as training in machine learning techniques for pedestrian detection and counting, it can certainly facilitate and optimize the whole process of event detection. In particular, we propose to parametrize virtual humans using a data-driven approach. Results demonstrated that using the extended datasets with CG images outperforms the results when compared to only real images sequences.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes