LGAIJan 25, 2022

Conditional entropy minimization principle for learning domain invariant representation features

arXiv:2201.10460v48 citations
Originality Incremental advance
AI Analysis

This work addresses domain generalization for machine learning applications, presenting an incremental improvement over existing methods.

The paper tackles the problem of domain generalization by addressing the mixing of true and spurious invariant features in invariance-principle-based methods, proposing a conditional entropy minimization framework that achieves competitive classification accuracy across several datasets.

Invariance-principle-based methods such as Invariant Risk Minimization (IRM), have recently emerged as promising approaches for Domain Generalization (DG). Despite promising theory, such approaches fail in common classification tasks due to the mixing of true invariant features and spurious invariant features. To address this, we propose a framework based on the conditional entropy minimization (CEM) principle to filter-out the spurious invariant features leading to a new algorithm with a better generalization capability. We show that our proposed approach is closely related to the well-known Information Bottleneck (IB) framework and prove that under certain assumptions, entropy minimization can exactly recover the true invariant features. Our approach provides competitive classification accuracy compared to recent theoretically-principled state-of-the-art alternatives across several DG datasets.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes