LGCVMLMay 31, 2019

Are Labels Required for Improving Adversarial Robustness?

arXiv:1905.13725v4359 citations
Originality Highly original
AI Analysis

This addresses the challenge of deploying robust machine learning models in real-world applications where labeled data is expensive, offering a practical solution by leveraging unlabeled data.

The paper tackles the problem of high labeled data requirements for training adversarially robust models by showing that unlabeled data can be a competitive alternative, achieving a 21.7% improvement in robust accuracy over using only 4K labeled examples on CIFAR-10 and a 4% improvement over the previous state-of-the-art with additional unlabeled data.

Recent work has uncovered the interesting (and somewhat surprising) finding that training models to be invariant to adversarial perturbations requires substantially larger datasets than those required for standard classification. This result is a key hurdle in the deployment of robust machine learning models in many real world applications where labeled data is expensive. Our main insight is that unlabeled data can be a competitive alternative to labeled data for training adversarially robust models. Theoretically, we show that in a simple statistical setting, the sample complexity for learning an adversarially robust model from unlabeled data matches the fully supervised case up to constant factors. On standard datasets like CIFAR-10, a simple Unsupervised Adversarial Training (UAT) approach using unlabeled data improves robust accuracy by 21.7% over using 4K supervised examples alone, and captures over 95% of the improvement from the same number of labeled examples. Finally, we report an improvement of 4% over the previous state-of-the-art on CIFAR-10 against the strongest known attack by using additional unlabeled data from the uncurated 80 Million Tiny Images dataset. This demonstrates that our finding extends as well to the more realistic case where unlabeled data is also uncurated, therefore opening a new avenue for improving adversarial training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes