LGMar 11, 2025

Domain Adaptation and Entanglement: an Optimal Transport Perspective

Okan Koç, Alexander Soen, Chao-Kai Chiang, Masashi Sugiyama

arXiv:2503.08155v113.04 citationsh-index: 2AISTATS

Originality Incremental advance

AI Analysis

This work addresses the brittleness of machine learning systems under distribution shifts, providing a novel theoretical perspective for domain adaptation, though it is incremental in nature.

The paper tackles the problem of robustness to distribution shifts in unsupervised domain adaptation by deriving new theoretical bounds based on optimal transport, which include an 'entanglement' term to explain performance variations across algorithms and scenarios.

Current machine learning systems are brittle in the face of distribution shifts (DS), where the target distribution that the system is tested on differs from the source distribution used to train the system. This problem of robustness to DS has been studied extensively in the field of domain adaptation. For deep neural networks, a popular framework for unsupervised domain adaptation (UDA) is domain matching, in which algorithms try to align the marginal distributions in the feature or output space. The current theoretical understanding of these methods, however, is limited and existing theoretical results are not precise enough to characterize their performance in practice. In this paper, we derive new bounds based on optimal transport that analyze the UDA problem. Our new bounds include a term which we dub as \emph{entanglement}, consisting of an expectation of Wasserstein distance between conditionals with respect to changing data distributions. Analysis of the entanglement term provides a novel perspective on the unoptimizable aspects of UDA. In various experiments with multiple models across several DS scenarios, we show that this term can be used to explain the varying performance of UDA algorithms.

View on arXiv PDF

Similar