MLJul 19, 2015

Invariant Models for Causal Transfer Learning

Mateo Rojas-Carulla, Bernhard Schölkopf, Richard Turner, Jonas Peters

arXiv:1507.05333v438.4111 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses domain generalization for machine learning practitioners by providing a causal-inspired method that relaxes covariate shift assumptions, though it is incremental as it builds on existing causal and transfer learning frameworks.

The paper tackles the problem of Domain Generalization by assuming an invariant conditional distribution of the target given a subset of predictors across tasks, motivated by causal ideas. It proves optimality in adversarial settings and shows improved performance over data pooling on synthetic and gene deletion datasets, with specific examples demonstrating gains.

Methods of transfer learning try to combine knowledge from several related tasks (or domains) to improve performance on a test task. Inspired by causal methodology, we relax the usual covariate shift assumption and assume that it holds true for a subset of predictor variables: the conditional distribution of the target variable given this subset of predictors is invariant over all tasks. We show how this assumption can be motivated from ideas in the field of causality. We focus on the problem of Domain Generalization, in which no examples from the test task are observed. We prove that in an adversarial setting using this subset for prediction is optimal in Domain Generalization; we further provide examples, in which the tasks are sufficiently diverse and the estimator therefore outperforms pooling the data, even on average. If examples from the test task are available, we also provide a method to transfer knowledge from the training tasks and exploit all available features for prediction. However, we provide no guarantees for this method. We introduce a practical method which allows for automatic inference of the above subset and provide corresponding code. We present results on synthetic data sets and a gene deletion data set.

View on arXiv PDF Code

Similar