CVFeb 12, 2022

Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection

Zhize Wu, Xiaofeng Wang, Tong Xu, Xuebin Yang, Le Zou, Lixiang Xu, Thomas Weise

arXiv:2202.05941v21.41 citations

Originality Incremental advance

AI Analysis

This work addresses domain shift issues in object detection for computer vision applications, representing an incremental improvement over existing domain adaptation methods.

The paper tackles the problem of domain shift in object detection by introducing a domain adaptation component at the region level within Faster R-CNN, using adversarial training with a balanced domain classifier to generate accurate region proposals across domains, achieving effectiveness and robustness as demonstrated in experiments on four standard datasets.

Object recognition from images means to automatically find object(s) of interest and to return their category and location information. Benefiting from research on deep learning, like convolutional neural networks~(CNNs) and generative adversarial networks, the performance in this field has been improved significantly, especially when training and test data are drawn from similar distributions. However, mismatching distributions, i.e., domain shifts, lead to a significant performance drop. In this paper, we build domain-invariant detectors by learning domain classifiers via adversarial training. Based on the previous works that align image and instance level features, we mitigate the domain shift further by introducing a domain adaptation component at the region level within Faster \mbox{R-CNN}. We embed a domain classification network in the region proposal network~(RPN) using adversarial learning. The RPN can now generate accurate region proposals in different domains by effectively aligning the features between them. To mitigate the unstable convergence during the adversarial learning, we introduce a balanced domain classifier as well as a network learning rate adjustment strategy. We conduct comprehensive experiments using four standard datasets. The results demonstrate the effectiveness and robustness of our object detection approach in domain shift scenarios.

View on arXiv PDF

Similar