LGOct 6, 2023

Generating Less Certain Adversarial Examples Improves Robust Generalization

arXiv:2310.04539v43 citationsh-index: 2Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of robust overfitting in adversarial training for machine learning practitioners, offering an incremental improvement by modifying adversarial example generation to reduce certainty.

The paper tackles robust overfitting in adversarial training by hypothesizing that overconfidence in predicting adversarial examples harms robust generalization, and proposes generating less certain adversarial examples to improve it, achieving consistently improved robustness in experiments on image benchmarks.

This paper revisits the robust overfitting phenomenon of adversarial training. Observing that models with better robust generalization performance are less certain in predicting adversarially generated training inputs, we argue that overconfidence in predicting adversarial examples is a potential cause. Therefore, we hypothesize that generating less certain adversarial examples improves robust generalization, and propose a formal definition of adversarial certainty that captures the variance of the model's predicted logits on adversarial examples. Our theoretical analysis of synthetic distributions characterizes the connection between adversarial certainty and robust generalization. Accordingly, built upon the notion of adversarial certainty, we develop a general method to search for models that can generate training-time adversarial inputs with reduced certainty, while maintaining the model's capability in distinguishing adversarial examples. Extensive experiments on image benchmarks demonstrate that our method effectively learns models with consistently improved robustness and mitigates robust overfitting, confirming the importance of generating less certain adversarial examples for robust generalization. Our implementations are available as open-source code at: https://github.com/TrustMLRG/AdvCertainty.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes