LGCVIVMLNov 25, 2019

One Man's Trash is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples

arXiv:1911.11219v222 citations
Originality Highly original
AI Analysis

This addresses the security problem of adversarial attacks in image classification for AI systems, offering a novel defense approach that is more robust and efficient than existing methods.

The paper tackles the vulnerability of deep neural networks to adversarial examples by proposing a defense that transforms input images into adversarial examples using a pre-trained external model, achieving significantly higher robustness on CIFAR-10 and Tiny ImageNet datasets compared to state-of-the-art methods with lower training costs.

Modern image classification systems are often built on deep neural networks, which suffer from adversarial examples--images with deliberately crafted, imperceptible noise to mislead the network's classification. To defend against adversarial examples, a plausible idea is to obfuscate the network's gradient with respect to the input image. This general idea has inspired a long line of defense methods. Yet, almost all of them have proven vulnerable. We revisit this seemingly flawed idea from a radically different perspective. We embrace the omnipresence of adversarial examples and the numerical procedure of crafting them, and turn this harmful attacking process into a useful defense mechanism. Our defense method is conceptually simple: before feeding an input image for classification, transform it by finding an adversarial example on a pre-trained external model. We evaluate our method against a wide range of possible attacks. On both CIFAR-10 and Tiny ImageNet datasets, our method is significantly more robust than state-of-the-art methods. Particularly, in comparison to adversarial training, our method offers lower training cost as well as stronger robustness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes