CVNov 14, 2018

Domain Randomization for Scene-Specific Car Detection and Pose Estimation

Rawal Khirodkar, Donghyun Yoo, Kris M. Kitani

arXiv:1811.05939v112.452 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the domain gap issue for researchers and practitioners in computer vision who rely on synthetic data for training scene-specific object detection and pose estimation models, representing an incremental improvement over existing methods.

The paper tackles the domain gap problem when using synthetic data to train scene-specific car detection and pose estimation models by proposing appearance randomization to generate diverse synthetic objects that better match real-world data distributions. It demonstrates that this approach outperforms fine-tuning off-the-shelf models on limited real data across VIRAT, UA-DETRAC, and EPFL-Car datasets.

We address the issue of domain gap when making use of synthetic data to train a scene-specific object detector and pose estimator. While previous works have shown that the constraints of learning a scene-specific model can be leveraged to create geometrically and photometrically consistent synthetic data, care must be taken to design synthetic content which is as close as possible to the real-world data distribution. In this work, we propose to solve domain gap through the use of appearance randomization to generate a wide range of synthetic objects to span the space of realistic images for training. An ablation study of our results is presented to delineate the individual contribution of different components in the randomization process. We evaluate our method on VIRAT, UA-DETRAC, EPFL-Car datasets, where we demonstrate that using scene specific domain randomized synthetic data is better than fine-tuning off-the-shelf models on limited real data.

View on arXiv PDF Code

Similar