LGMLDec 2, 2022

Stable Learning via Sparse Variable Independence

arXiv:2212.00992v125 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses covariate-shift generalization for machine learning applications, representing an incremental improvement over previous stable learning methods.

The paper tackles the problem of covariate-shift generalization by proposing SVI (Sparse Variable Independence), which combines sample reweighting and sparsity-based variable selection to improve performance, as demonstrated in experiments on synthetic and real-world datasets.

The problem of covariate-shift generalization has attracted intensive research attention. Previous stable learning algorithms employ sample reweighting schemes to decorrelate the covariates when there is no explicit domain information about training data. However, with finite samples, it is difficult to achieve the desirable weights that ensure perfect independence to get rid of the unstable variables. Besides, decorrelating within stable variables may bring about high variance of learned models because of the over-reduced effective sample size. A tremendous sample size is required for these algorithms to work. In this paper, with theoretical justification, we propose SVI (Sparse Variable Independence) for the covariate-shift generalization problem. We introduce sparsity constraint to compensate for the imperfectness of sample reweighting under the finite-sample setting in previous methods. Furthermore, we organically combine independence-based sample reweighting and sparsity-based variable selection in an iterative way to avoid decorrelating within stable variables, increasing the effective sample size to alleviate variance inflation. Experiments on both synthetic and real-world datasets demonstrate the improvement of covariate-shift generalization performance brought by SVI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes