CVOct 4, 2021

Seeking Similarities over Differences: Similarity-based Domain Alignment for Adaptive Object Detection

arXiv:2110.01428v192 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of adapting object detectors to new domains without additional labeling, which is crucial for real-world deployment in varied scenarios, though it appears incremental by building on existing UDA methods.

The paper tackles the problem of aligning features for unsupervised domain adaptation in object detection by proposing a framework and a novel algorithm, ViSGA, which uses visual similarity and adversarial training to group and align features, achieving improved performance on Sim2Real and Adverse Weather benchmarks.

In order to robustly deploy object detectors across a wide range of scenarios, they should be adaptable to shifts in the input distribution without the need to constantly annotate new data. This has motivated research in Unsupervised Domain Adaptation (UDA) algorithms for detection. UDA methods learn to adapt from labeled source domains to unlabeled target domains, by inducing alignment between detector features from source and target domains. Yet, there is no consensus on what features to align and how to do the alignment. In our work, we propose a framework that generalizes the different components commonly used by UDA methods laying the ground for an in-depth analysis of the UDA design space. Specifically, we propose a novel UDA algorithm, ViSGA, a direct implementation of our framework, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level based on visual similarity before inducing group alignment via adversarial training. We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains. Finally, we examine the applicability of ViSGA to the setting where labeled data are gathered from different sources. Experiments show that not only our method outperforms previous single-source approaches on Sim2Real and Adverse Weather, but also generalizes well to the multi-source setting.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes