CVSep 17, 2020

Label Smoothing and Adversarial Robustness

arXiv:2009.08233v114 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of evaluating adversarial robustness for machine learning models, highlighting vulnerabilities in current methods, but it is incremental as it builds on existing label smoothing techniques.

The paper investigates the relationship between label smoothing and adversarial robustness, finding that training models with label smoothing can achieve up to 75% robust accuracy on CIFAR-10 under PGD attacks, but the robustness is incomplete and volatile.

Recent studies indicate that current adversarial attack methods are flawed and easy to fail when encountering some deliberately designed defense. Sometimes even a slight modification in the model details will invalidate the attack. We find that training model with label smoothing can easily achieve striking accuracy under most gradient-based attacks. For instance, the robust accuracy of a WideResNet model trained with label smoothing on CIFAR-10 achieves 75% at most under PGD attack. To understand the reason underlying the subtle robustness, we investigate the relationship between label smoothing and adversarial robustness. Through theoretical analysis about the characteristics of the network trained with label smoothing and experiment verification of its performance under various attacks. We demonstrate that the robustness produced by label smoothing is incomplete based on the fact that its defense effect is volatile, and it cannot defend attacks transferred from a naturally trained model. Our study enlightens the research community to rethink how to evaluate the model's robustness appropriately.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes