CV LGJun 1, 2023

Maximizing Information in Domain-Invariant Representation Improves Transfer Learning

Adrian Shuai Li, Elisa Bertino, Xuan-Hong Dang, Ankush Singla, Yuhai Tu, Mark N Wegman

arXiv:2306.00262v53.92 citationsh-index: 24

Originality Incremental advance

AI Analysis

This addresses domain adaptation for transfer learning, with incremental improvements over existing methods like Domain-Separation Networks.

The paper tackles the problem of domain adaptation by improving the separation of domain-independent and domain-dependent components in data representations, showing that MaxDIRep achieves strong performance on standard benchmarks and generalizes to non-image tasks.

We propose MaxDIRep, a domain adaptation method that improves the decomposition of data representations into domain-independent and domain-dependent components. Existing methods, such as Domain-Separation Networks (DSN), use a weak orthogonality constraint between these components, which can lead to label-relevant features being partially encoded in the domain-dependent representation (DDRep) rather than the domain-independent representation (DIRep). As a result, information crucial for target-domain classification may be missing from the DIRep. MaxDIRep addresses this issue by applying a Kullback-Leibler (KL) divergence constraint to minimize the information content of the DDRep, thereby encouraging the DIRep to retain features that are both domain-invariant and predictive of target labels. Through geometric analysis and an ablation study on synthetic datasets, we show why DSN's weaker constraint can lead to suboptimal adaptation. Experiments on standard image benchmarks and a network intrusion detection task demonstrate that MaxDIRep achieves strong performance, works with pretrained models, and generalizes to non-image classification tasks.

View on arXiv PDF

Similar