LGDec 24, 2021

Disentanglement by Cyclic Reconstruction

arXiv:2112.12980v25.55 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses domain adaptation for machine learning models, but it is incremental as it builds on existing disentanglement and adversarial methods.

The paper tackles the problem of domain-specific bias in supervised learning by splitting information into task-related and context representations, using adversarial feature predictors and cyclic reconstruction to disentangle them, and demonstrates improved generalization and domain adaptation performance on benchmarks.

Deep neural networks have demonstrated their ability to automatically extract meaningful features from data. However, in supervised learning, information specific to the dataset used for training, but irrelevant to the task at hand, may remain encoded in the extracted representations. This remaining information introduces a domain-specific bias, weakening the generalization performance. In this work, we propose splitting the information into a task-related representation and its complementary context representation. We propose an original method, combining adversarial feature predictors and cyclic reconstruction, to disentangle these two representations in the single-domain supervised case. We then adapt this method to the unsupervised domain adaptation problem, consisting of training a model capable of performing on both a source and a target domain. In particular, our method promotes disentanglement in the target domain, despite the absence of training labels. This enables the isolation of task-specific information from both domains and a projection into a common representation. The task-specific representation allows efficient transfer of knowledge acquired from the source domain to the target domain. In the single-domain case, we demonstrate the quality of our representations on information retrieval tasks and the generalization benefits induced by sharpened task-specific representations. We then validate the proposed method on several classical domain adaptation benchmarks and illustrate the benefits of disentanglement for domain adaptation.

View on arXiv PDF Code

Similar