LG AI MLSep 28, 2023

Discovering environments with XRM

Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

arXiv:2309.16748v218.022 citationsh-index: 34Has Code

Originality Highly original

AI Analysis

This addresses a long-standing challenge in OOD generalization by enabling robust algorithms without costly human annotations, though it is incremental as it builds on prior environment-based methods.

The paper tackles the problem of automatic environment discovery for out-of-distribution generalization by proposing Cross-Risk-Minimization (XRM), which trains twin networks to imitate each other's confident mistakes without needing human-annotated environments, achieving oracle worst-group-accuracy.

Environment annotations are essential for the success of many out-of-distribution (OOD) generalization methods. Unfortunately, these are costly to obtain and often limited by human annotators' biases. To achieve robust generalization, it is essential to develop algorithms for automatic environment discovery within datasets. Current proposals, which divide examples based on their training error, suffer from one fundamental problem. These methods introduce hyper-parameters and early-stopping criteria, which require a validation set with human-annotated environments, the very information subject to discovery. In this paper, we propose Cross-Risk-Minimization (XRM) to address this issue. XRM trains twin networks, each learning from one random half of the training data, while imitating confident held-out mistakes made by its sibling. XRM provides a recipe for hyper-parameter tuning, does not require early-stopping, and can discover environments for all training and validation data. Algorithms built on top of XRM environments achieve oracle worst-group-accuracy, addressing a long-standing challenge in OOD generalization. Code available at \url{https://github.com/facebookresearch/XRM}.

View on arXiv PDF Code

Similar