CVNov 14, 2018

Domain Randomization for Scene-Specific Car Detection and Pose Estimation

arXiv:1811.05939v152 citations
Originality Incremental advance
AI Analysis

This work addresses the domain gap issue for researchers and practitioners in computer vision who rely on synthetic data for training scene-specific object detection and pose estimation models, representing an incremental improvement over existing methods.

The paper tackles the domain gap problem when using synthetic data to train scene-specific car detection and pose estimation models by proposing appearance randomization to generate diverse synthetic objects that better match real-world data distributions. It demonstrates that this approach outperforms fine-tuning off-the-shelf models on limited real data across VIRAT, UA-DETRAC, and EPFL-Car datasets.

We address the issue of domain gap when making use of synthetic data to train a scene-specific object detector and pose estimator. While previous works have shown that the constraints of learning a scene-specific model can be leveraged to create geometrically and photometrically consistent synthetic data, care must be taken to design synthetic content which is as close as possible to the real-world data distribution. In this work, we propose to solve domain gap through the use of appearance randomization to generate a wide range of synthetic objects to span the space of realistic images for training. An ablation study of our results is presented to delineate the individual contribution of different components in the randomization process. We evaluate our method on VIRAT, UA-DETRAC, EPFL-Car datasets, where we demonstrate that using scene specific domain randomized synthetic data is better than fine-tuning off-the-shelf models on limited real data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes