LG AI MLJan 27, 2019

On Learning Invariant Representation for Domain Adaptation

Han Zhao, Remi Tachet des Combes, Kun Zhang, Geoffrey J. Gordon

arXiv:1901.09453v226.7161 citations

Originality Highly original

AI Analysis

This work addresses a core theoretical gap in domain adaptation for machine learning practitioners, offering insights that could guide future algorithm design.

The paper demonstrates that learning domain-invariant features is insufficient for successful domain adaptation due to conditional shift, and provides theoretical bounds and experiments to characterize a fundamental tradeoff between invariance and joint error.

Due to the ability of deep neural nets to learn rich representations, recent advances in unsupervised domain adaptation have focused on learning domain-invariant features that achieve a small error on the source domain. The hope is that the learnt representation, together with the hypothesis learnt from the source domain, can generalize to the target domain. In this paper, we first construct a simple counterexample showing that, contrary to common belief, the above conditions are not sufficient to guarantee successful domain adaptation. In particular, the counterexample exhibits \emph{conditional shift}: the class-conditional distributions of input features change between source and target domains. To give a sufficient condition for domain adaptation, we propose a natural and interpretable generalization upper bound that explicitly takes into account the aforementioned shift. Moreover, we shed new light on the problem by proving an information-theoretic lower bound on the joint error of \emph{any} domain adaptation method that attempts to learn invariant representations. Our result characterizes a fundamental tradeoff between learning invariant representations and achieving small joint error on both domains when the marginal label distributions differ from source to target. Finally, we conduct experiments on real-world datasets that corroborate our theoretical findings. We believe these insights are helpful in guiding the future design of domain adaptation and representation learning algorithms.

View on arXiv PDF

Similar