CVDec 17, 2019

Learning from Synthetic Animals

arXiv:1912.08265v2142 citations
Originality Incremental advance
AI Analysis

This addresses the problem of limited labeled data for animal parsing in computer vision, offering a domain adaptation solution that is incremental as it builds on existing semi-supervised and synthetic data techniques.

The paper tackles the lack of labeled data for parsing deformable animals by using synthetic images from CAD models, proposing a consistency-constrained semi-supervised learning method to bridge the domain gap and achieve accurate keypoint prediction on real images without real labels, with models showing better generalization across domains in the Visual Domain Adaptation Challenge dataset.

Despite great success in human parsing, progress for parsing other deformable articulated objects, like animals, is still limited by the lack of labeled data. In this paper, we use synthetic images and ground truth generated from CAD animal models to address this challenge. To bridge the domain gap between real and synthetic images, we propose a novel consistency-constrained semi-supervised learning method (CC-SSL). Our method leverages both spatial and temporal consistencies, to bootstrap weak models trained on synthetic data with unlabeled real images. We demonstrate the effectiveness of our method on highly deformable animals, such as horses and tigers. Without using any real image label, our method allows for accurate keypoint prediction on real images. Moreover, we quantitatively show that models using synthetic data achieve better generalization performance than models trained on real images across different domains in the Visual Domain Adaptation Challenge dataset. Our synthetic dataset contains 10+ animals with diverse poses and rich ground truth, which enables us to use the multi-task learning strategy to further boost models' performance.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes