LGJun 14, 2021

KL Guided Domain Adaptation

A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atılım Güneş Baydin

arXiv:2106.07780v217.961 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of unstable and expensive domain adaptation methods for practitioners, offering a more efficient and stable approach.

The paper tackles domain adaptation by deriving a generalization bound based on reverse KL divergence and proposes an efficient algorithm that minimizes this term to improve target domain performance, showing it outperforms other representation-alignment methods in experiments.

Domain adaptation is an important problem and often needed for real-world applications. In this problem, instead of i.i.d. training and testing datapoints, we assume that the source (training) data and the target (testing) data have different distributions. With that setting, the empirical risk minimization training procedure often does not perform well, since it does not account for the change in the distribution. A common approach in the domain adaptation literature is to learn a representation of the input that has the same (marginal) distribution over the source and the target domain. However, these approaches often require additional networks and/or optimizing an adversarial (minimax) objective, which can be very expensive or unstable in practice. To improve upon these marginal alignment techniques, in this paper, we first derive a generalization bound for the target loss based on the training loss and the reverse Kullback-Leibler (KL) divergence between the source and the target representation distributions. Based on this bound, we derive an algorithm that minimizes the KL term to obtain a better generalization to the target domain. We show that with a probabilistic representation network, the KL term can be estimated efficiently via minibatch samples without any additional network or a minimax objective. This leads to a theoretically sound alignment method which is also very efficient and stable in practice. Experimental results also suggest that our method outperforms other representation-alignment approaches.

View on arXiv PDF Code

Similar