LGOct 19, 2021

Learning Representations that Support Robust Transfer of Predictors

arXiv:2110.09940v113.128 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the problem of domain shift for machine learning practitioners by providing an incremental improvement over existing methods like IRM.

The paper tackles the challenge of domain shift degrading model performance in unseen environments by introducing a robust estimation criterion called transfer risk, which optimizes representations for better out-of-distribution generalization and outperforms baselines in experiments.

Ensuring generalization to unseen environments remains a challenge. Domain shift can lead to substantially degraded performance unless shifts are well-exercised within the available training environments. We introduce a simple robust estimation criterion -- transfer risk -- that is specifically geared towards optimizing transfer to new environments. Effectively, the criterion amounts to finding a representation that minimizes the risk of applying any optimal predictor trained on one environment to another. The transfer risk essentially decomposes into two terms, a direct transfer term and a weighted gradient-matching term arising from the optimality of per-environment predictors. Although inspired by IRM, we show that transfer risk serves as a better out-of-distribution generalization criterion, both theoretically and empirically. We further demonstrate the impact of optimizing such transfer risk on two controlled settings, each representing a different pattern of environment shift, as well as on two real-world datasets. Experimentally, the approach outperforms baselines across various out-of-distribution generalization tasks. Code is available at \url{https://github.com/Newbeeer/TRM}.

View on arXiv PDF Code

Similar