LGCVJun 3, 2024

Mixup Augmentation with Multiple Interpolations

arXiv:2406.01417v1
Originality Incremental advance
AI Analysis

This is an incremental improvement for machine learning practitioners using data augmentation techniques.

The paper tackles the limitation of standard mixup augmentation generating only one interpolation per sample pair by proposing multi-mix, which creates multiple interpolations, resulting in improved generalization, robustness, and calibration across synthetic and large-scale datasets.

Mixup and its variants form a popular class of data augmentation techniques.Using a random sample pair, it generates a new sample by linear interpolation of the inputs and labels. However, generating only one single interpolation may limit its augmentation ability. In this paper, we propose a simple yet effective extension called multi-mix, which generates multiple interpolations from a sample pair. With an ordered sequence of generated samples, multi-mix can better guide the training process than standard mixup. Moreover, theoretically, this can also reduce the stochastic gradient variance. Extensive experiments on a number of synthetic and large-scale data sets demonstrate that multi-mix outperforms various mixup variants and non-mixup-based baselines in terms of generalization, robustness, and calibration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes