CVJul 18, 2024

CycleMix: Mixing Source Domains for Domain Generalization in Style-Dependent Data

arXiv:2407.13421v2h-index: 8
Originality Incremental advance
AI Analysis

This addresses the issue of spurious style-class correlations in training data for machine learning systems, though it is incremental as it builds on existing CycleGAN and domain generalization methods.

The paper tackled the domain generalization problem in image classification by training a robust feature extractor to disregard style-dependent features and use style-invariant representations, achieving validation on the PACS benchmark.

As deep learning-based systems have become an integral part of everyday life, limitations in their generalization ability have begun to emerge. Machine learning algorithms typically rely on the i.i.d. assumption, meaning that their training and validation data are expected to follow the same distribution, which does not necessarily hold in practice. In the case of image classification, one frequent reason that algorithms fail to generalize is that they rely on spurious correlations present in training data, such as associating image styles with target classes. These associations may not be present in the unseen test data, leading to significant degradation of their effectiveness. In this work, we attempt to mitigate this Domain Generalization (DG) problem by training a robust feature extractor which disregards features attributed to image-style but infers based on style-invariant image representations. To achieve this, we train CycleGAN models to learn the different styles present in the training data and randomly mix them together to create samples with novel style attributes to improve generalization. Experimental results on the PACS DG benchmark validate the proposed method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes