LGMLSep 9, 2021

Discovery of New Multi-Level Features for Domain Generalization via Knowledge Corruption

arXiv:2109.04320v3
Originality Highly original
AI Analysis

This addresses the challenge of machine learning models generalizing to unseen domains, which is critical for real-world applications with domain shifts, representing a novel method for a known bottleneck.

The paper tackles the domain generalization problem by proposing COLUMBUS, a method that discovers new multi-level features through targeted corruption, achieving state-of-the-art results by outperforming 18 algorithms on multiple benchmark datasets in DomainBed.

Machine learning models that can generalize to unseen domains are essential when applied in real-world scenarios involving strong domain shifts. We address the challenging domain generalization (DG) problem, where a model trained on a set of source domains is expected to generalize well in unseen domains without any exposure to their data. The main challenge of DG is that the features learned from the source domains are not necessarily present in the unseen target domains, leading to performance deterioration. We assume that learning a richer set of features is crucial to improve the transfer to a wider set of unknown domains. For this reason, we propose COLUMBUS, a method that enforces new feature discovery via a targeted corruption of the most relevant input and multi-level representations of the data. We conduct an extensive empirical evaluation to demonstrate the effectiveness of the proposed approach which achieves new state-of-the-art results by outperforming 18 DG algorithms on multiple DG benchmark datasets in the DomainBed framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes