LGAIMLSep 28, 2023

Discovering environments with XRM

arXiv:2309.16748v222 citationsh-index: 34Has Code
Originality Highly original
AI Analysis

This addresses a long-standing challenge in OOD generalization by enabling robust algorithms without costly human annotations, though it is incremental as it builds on prior environment-based methods.

The paper tackles the problem of automatic environment discovery for out-of-distribution generalization by proposing Cross-Risk-Minimization (XRM), which trains twin networks to imitate each other's confident mistakes without needing human-annotated environments, achieving oracle worst-group-accuracy.

Environment annotations are essential for the success of many out-of-distribution (OOD) generalization methods. Unfortunately, these are costly to obtain and often limited by human annotators' biases. To achieve robust generalization, it is essential to develop algorithms for automatic environment discovery within datasets. Current proposals, which divide examples based on their training error, suffer from one fundamental problem. These methods introduce hyper-parameters and early-stopping criteria, which require a validation set with human-annotated environments, the very information subject to discovery. In this paper, we propose Cross-Risk-Minimization (XRM) to address this issue. XRM trains twin networks, each learning from one random half of the training data, while imitating confident held-out mistakes made by its sibling. XRM provides a recipe for hyper-parameter tuning, does not require early-stopping, and can discover environments for all training and validation data. Algorithms built on top of XRM environments achieve oracle worst-group-accuracy, addressing a long-standing challenge in OOD generalization. Code available at \url{https://github.com/facebookresearch/XRM}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes