LG CV MLSep 25, 2019

Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks

arXiv:1909.11515v2114 citations

Originality Incremental advance

AI Analysis

This work addresses the vulnerability of deep learning models to adversarial attacks, offering an incremental improvement by better exploiting existing mixup training techniques.

The paper tackles the problem of adversarial attacks on deep networks by proposing mixup inference (MI), which actively uses mixup during inference to break the locality of perturbations, resulting in improved adversarial robustness on CIFAR-10 and CIFAR-100 datasets.

It has been widely recognized that adversarial examples can be easily crafted to fool deep networks, which mainly root from the locally non-linear behavior nearby input examples. Applying mixup in training provides an effective mechanism to improve generalization performance and model robustness against adversarial perturbations, which introduces the globally linear behavior in-between training examples. However, in previous work, the mixup-trained models only passively defend adversarial attacks in inference by directly classifying the inputs, where the induced global linearity is not well exploited. Namely, since the locality of the adversarial perturbations, it would be more efficient to actively break the locality via the globality of the model predictions. Inspired by simple geometric intuition, we develop an inference principle, named mixup inference (MI), for mixup-trained models. MI mixups the input with other random clean samples, which can shrink and transfer the equivalent perturbation if the input is adversarial. Our experiments on CIFAR-10 and CIFAR-100 demonstrate that MI can further improve the adversarial robustness for the models trained by mixup and its variants.

View on arXiv PDF

Similar