CVApr 29, 2022

Improving Transferability for Domain Adaptive Detection Transformers

arXiv:2204.14195v349 citationsh-index: 74
Originality Incremental advance
AI Analysis

This work addresses domain adaptation for object detection transformers, which is an incremental improvement for computer vision applications.

The paper tackles the problem of domain shift in DETR-style object detectors by proposing two alignment modules (Object-Aware Alignment and Optimal Transport based Alignment) to improve transferability, achieving favorable results across various domain adaptive scenarios.

DETR-style detectors stand out amongst in-domain scenarios, but their properties in domain shift settings are under-explored. This paper aims to build a simple but effective baseline with a DETR-style detector on domain shift settings based on two findings. For one, mitigating the domain shift on the backbone and the decoder output features excels in getting favorable results. For another, advanced domain alignment methods in both parts further enhance the performance. Thus, we propose the Object-Aware Alignment (OAA) module and the Optimal Transport based Alignment (OTA) module to achieve comprehensive domain alignment on the outputs of the backbone and the detector. The OAA module aligns the foreground regions identified by pseudo-labels in the backbone outputs, leading to domain-invariant based features. The OTA module utilizes sliced Wasserstein distance to maximize the retention of location information while minimizing the domain gap in the decoder outputs. We implement the findings and the alignment modules into our adaptation method, and it benchmarks the DETR-style detector on the domain shift settings. Experiments on various domain adaptive scenarios validate the effectiveness of our method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes