Anti-causal domain generalization: Leveraging unlabeled data
This work addresses domain generalization for scenarios with scarce labeled data, offering a novel approach that leverages unlabeled data to improve robustness to distribution shifts.
The paper tackles domain generalization in anti-causal settings by proposing methods that regularize model sensitivity to covariate perturbations using unlabeled data, achieving worst-case optimality guarantees and demonstrating empirical performance on controlled and physiological datasets.
The problem of domain generalization concerns learning predictive models that are robust to distribution shifts when deployed in new, previously unseen environments. Existing methods typically require labeled data from multiple training environments, limiting their applicability when labeled data are scarce. In this work, we study domain generalization in an anti-causal setting, where the outcome causes the observed covariates. Under this structure, environment perturbations that affect the covariates do not propagate to the outcome, which motivates regularizing the model's sensitivity to these perturbations. Crucially, estimating these perturbation directions does not require labels, enabling us to leverage unlabeled data from multiple environments. We propose two methods that penalize the model's sensitivity to variations in the mean and covariance of the covariates across environments, respectively, and prove that these methods have worst-case optimality guarantees under certain classes of environments. Finally, we demonstrate the empirical performance of our approach on a controlled physical system and a physiological signal dataset.