CVDec 3, 2024

GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

arXiv:2412.02366v411 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses the challenge of domain gaps in data augmentation for visual classification, offering improvements in generalization, adversarial robustness, and scenarios like data scarcity, though it appears incremental as it builds on existing generative methods.

The paper tackles the problem of data augmentation for visual classification when source and target domains differ, introducing GenMix, a prompt-guided generative approach that improves performance in both in-domain and cross-domain settings, achieving stronger results compared to state-of-the-art methods across eight datasets.

Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps. This paper introduces GenMix, a generalizable prompt-guided generative data augmentation approach that enhances both in-domain and cross-domain image classification. Our technique leverages image editing to generate augmented images based on custom conditional prompts, designed specifically for each problem type. By blending portions of the input image with its edited generative counterpart and incorporating fractal patterns, our approach mitigates unrealistic images and label ambiguity, improving the performance and adversarial robustness of the resulting models. Efficacy of our method is established with extensive experiments on eight public datasets for general and fine-grained classification, in both in-domain and cross-domain settings. Additionally, we demonstrate performance improvements for self-supervised learning, learning with data scarcity, and adversarial robustness. As compared to the existing state-of-the-art methods, our technique achieves stronger performance across the board.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes