MLLGApr 12, 2018

Causal Generative Domain Adaptation Networks

arXiv:1804.04333v322 citations
Originality Incremental advance
AI Analysis

This addresses domain adaptation for machine learning applications where data distributions shift, offering a method to generate new domain data and improve cross-domain prediction, though it appears incremental as it builds on existing generative and causal modeling approaches.

The paper tackles the problem of domain adaptation by modeling distribution changes across domains with a generative network (G-DAN) and improves it using causal representations to create CG-DAN, enhancing efficiency and enabling data generation in new domains, with experiments showing efficacy in synthetic and real data.

An essential problem in domain adaptation is to understand and make use of distribution changes across domains. For this purpose, we first propose a flexible Generative Domain Adaptation Network (G-DAN) with specific latent variables to capture changes in the generating process of features across domains. By explicitly modeling the changes, one can even generate data in new domains using the generating process with new values for the latent variables in G-DAN. In practice, the process to generate all features together may involve high-dimensional latent variables, requiring dealing with distributions in high dimensions and making it difficult to learn domain changes from few source domains. Interestingly, by further making use of the causal representation of joint distributions, we then decompose the joint distribution into separate modules, each of which involves different low-dimensional latent variables and can be learned separately, leading to a Causal G-DAN (CG-DAN). This improves both statistical and computational efficiency of the learning procedure. Finally, by matching the feature distribution in the target domain, we can recover the target-domain joint distribution and derive the learning machine for the target domain. We demonstrate the efficacy of both G-DAN and CG-DAN in domain generation and cross-domain prediction on both synthetic and real data experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes