CVAIFeb 7, 2023

Effective Data Augmentation With Diffusion Models

arXiv:2302.07944v3392 citationsh-index: 119
AI Analysis

This addresses the need for more diverse data augmentation in deep learning, particularly for tasks requiring semantic variations, but it is incremental as it builds on existing diffusion models.

The paper tackles the problem of limited diversity in data augmentation by using pre-trained text-to-image diffusion models to edit images and change high-level semantic attributes, resulting in improved accuracy on few-shot image classification and a real-world weed recognition task.

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation combines simple transformations like rotations and flips to generate new images from existing ones. However, these new images lack diversity along key semantic axes present in the data. Current augmentations cannot alter the high-level semantic attributes, such as animal species present in a scene, to enhance the diversity of data. We address the lack of diversity in data augmentation with image-to-image transformations parameterized by pre-trained text-to-image diffusion models. Our method edits images to change their semantics using an off-the-shelf diffusion model, and generalizes to novel visual concepts from a few labelled examples. We evaluate our approach on few-shot image classification tasks, and on a real-world weed recognition task, and observe an improvement in accuracy in tested domains.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes