CVApr 19, 2022

Image Data Augmentation for Deep Learning: A Survey

arXiv:2204.08610v2386 citationsh-index: 23
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that provides a structured overview for researchers and practitioners in computer vision dealing with data scarcity.

The paper systematically reviews image data augmentation methods for deep learning, proposing a taxonomy and evaluating them on three computer vision tasks to address limited labeled data.

Deep learning has achieved remarkable results in many computer vision tasks. Deep neural networks typically rely on large amounts of training data to avoid overfitting. However, labeled data for real-world applications may be limited. By improving the quantity and diversity of training data, data augmentation has become an inevitable part of deep learning model training with image data. As an effective way to improve the sufficiency and diversity of training data, data augmentation has become a necessary part of successful application of deep learning models on image data. In this paper, we systematically review different image data augmentation methods. We propose a taxonomy of reviewed methods and present the strengths and limitations of these methods. We also conduct extensive experiments with various data augmentation methods on three typical computer vision tasks, including semantic segmentation, image classification and object detection. Finally, we discuss current challenges faced by data augmentation and future research directions to put forward some useful research guidance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes