CVAug 8, 2018

A Semi-Supervised Data Augmentation Approach using 3D Graphical Engines

arXiv:1808.02595v215 citations
AI Analysis

This addresses the small data problem for applications like personalized human pose and behavior estimation, though it is incremental as it builds on existing data augmentation and domain adaptation methods.

The paper tackles the problem of limited labeled training data for human pose estimation by proposing a semi-supervised data augmentation approach using 3D graphical engines, resulting in a synthetic dataset (ScanAva) that achieves 91.2% pose estimation accuracy at PCK0.5 criteria, comparable to models trained on large real datasets like MPII.

Deep learning approaches have been rapidly adopted across a wide range of fields because of their accuracy and flexibility, but require large labeled training datasets. This presents a fundamental problem for applications with limited, expensive, or private data (i.e. small data), such as human pose and behavior estimation/tracking which could be highly personalized. In this paper, we present a semi-supervised data augmentation approach that can synthesize large scale labeled training datasets using 3D graphical engines based on a physically-valid low dimensional pose descriptor. To evaluate the performance of our synthesized datasets in training deep learning-based models, we generated a large synthetic human pose dataset, called ScanAva using 3D scans of only 7 individuals based on our proposed augmentation approach. A state-of-the-art human pose estimation deep learning model then was trained from scratch using our ScanAva dataset and could achieve the pose estimation accuracy of 91.2% at PCK0.5 criteria after applying an efficient domain adaptation on the synthetic images, in which its pose estimation accuracy was comparable to the same model trained on large scale pose data from real humans such as MPII dataset and much higher than the model trained on other synthetic human dataset such as SURREAL.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes