LGCVSep 2, 2021

Adversarial Robustness for Unsupervised Domain Adaptation

arXiv:2109.00946v147 citations
Originality Incremental advance
AI Analysis

This addresses the lack of adversarial robustness in UDA models, which is crucial for real-world applications, but it is incremental as it builds on existing UDA and robustness methods.

The paper tackles the problem of adversarial robustness in unsupervised domain adaptation (UDA) by aligning features with robust ImageNet models, resulting in significant improvements in robustness while maintaining clean accuracy on benchmarks.

Extensive Unsupervised Domain Adaptation (UDA) studies have shown great success in practice by learning transferable representations across a labeled source domain and an unlabeled target domain with deep models. However, previous works focus on improving the generalization ability of UDA models on clean examples without considering the adversarial robustness, which is crucial in real-world applications. Conventional adversarial training methods are not suitable for the adversarial robustness on the unlabeled target domain of UDA since they train models with adversarial examples generated by the supervised loss function. In this work, we leverage intermediate representations learned by multiple robust ImageNet models to improve the robustness of UDA models. Our method works by aligning the features of the UDA model with the robust features learned by ImageNet pre-trained models along with domain adaptation training. It utilizes both labeled and unlabeled domains and instills robustness without any adversarial intervention or label requirement during domain adaptation training. Experimental results show that our method significantly improves adversarial robustness compared to the baseline while keeping clean accuracy on various UDA benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes