LGMLFeb 13, 2020

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

arXiv:2002.05660v123 citations
AI Analysis

This work addresses the problem of machine learning models failing to generalize across unseen data domains, which is critical for real-world applications, but it appears incremental as it builds on existing domain generalization frameworks.

The paper tackles domain generalization by proposing a theoretical model where training data consists of multiple datasets from different domains drawn from a meta-distribution, and shows that computationally efficient, polynomial-sample domain generalization is possible in three problem settings, with experiments demonstrating improved generalization by ignoring spurious correlations.

Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a meta-distribution over data distributions, and those data distributions may even have different supports. In our model, the training data given to a learning algorithm consists of multiple datasets each from a single domain drawn in turn from the meta-distribution. We study this model in three different problem settings---a multi-domain Massart noise setting, a decision tree multi-dataset setting, and a feature selection setting, and find that computationally efficient, polynomial-sample domain generalization is possible in each. Experiments demonstrate that our feature selection algorithm indeed ignores spurious correlations and improves generalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes