LGAIITMLDec 31, 2021

Optimal Representations for Covariate Shift

arXiv:2201.00057v279 citations
AI Analysis

This addresses robustness issues in machine learning systems for applications where data distributions change, though it is incremental in improving existing methods like CLIP.

The paper tackles the problem of distribution shift between training and testing by introducing a variational objective that ensures representations are robust to covariate shifts, achieving state-of-the-art results on DomainBed.

Machine learning systems often experience a distribution shift between training and testing. In this paper, we introduce a simple variational objective whose optima are exactly the set of all representations on which risk minimizers are guaranteed to be robust to any distribution shift that preserves the Bayes predictor, e.g., covariate shifts. Our objective has two components. First, a representation must remain discriminative for the task, i.e., some predictor must be able to simultaneously minimize the source and target risk. Second, the representation's marginal support needs to be the same across source and target. We make this practical by designing self-supervised objectives that only use unlabelled data and augmentations to train robust representations. Our objectives give insights into the robustness of CLIP, and further improve CLIP's representations to achieve SOTA results on DomainBed.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes