CVCRLGSep 6, 2021

Robustness and Generalization via Generative Adversarial Training

arXiv:2109.02765v135 citations
Originality Highly original
AI Analysis

This addresses robustness and generalization issues in computer vision models, offering a method that is not incremental but aims for broad improvements across tasks like classification, segmentation, and object detection.

The paper tackles the problem of deep neural networks failing to generalize to new domains and input variations by proposing Generative Adversarial Training, which improves model performance on clean images and out-of-domain samples while enhancing robustness against unforeseen attacks, outperforming prior work.

While deep neural networks have achieved remarkable success in various computer vision tasks, they often fail to generalize to new domains and subtle variations of input images. Several defenses have been proposed to improve the robustness against these variations. However, current defenses can only withstand the specific attack used in training, and the models often remain vulnerable to other input variations. Moreover, these methods often degrade performance of the model on clean images and do not generalize to out-of-domain samples. In this paper we present Generative Adversarial Training, an approach to simultaneously improve the model's generalization to the test set and out-of-domain samples as well as its robustness to unseen adversarial attacks. Instead of altering a low-level pre-defined aspect of images, we generate a spectrum of low-level, mid-level and high-level changes using generative models with a disentangled latent space. Adversarial training with these examples enable the model to withstand a wide range of attacks by observing a variety of input alterations during training. We show that our approach not only improves performance of the model on clean images and out-of-domain samples but also makes it robust against unforeseen attacks and outperforms prior work. We validate effectiveness of our method by demonstrating results on various tasks such as classification, segmentation and object detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes