LGFeb 23, 2021

Non-Singular Adversarial Robustness of Neural Networks

Yu-Lin Tsai, Chia-Yi Hsu, Chia-Mu Yu, Pin-Yu Chen

arXiv:2102.11935v13.15 citations

Originality Incremental advance

AI Analysis

This addresses a critical gap in robustness assessment for neural networks, moving beyond singular input perturbations to provide a more comprehensive evaluation, which is incremental but important for security applications.

The paper tackles the problem of adversarial robustness in neural networks by showing that models robust to input perturbations remain vulnerable to weight perturbations, and proposes a formalization and training method for non-singular robustness against joint input-weight attacks, demonstrating improved performance.

Adversarial robustness has become an emerging challenge for neural network owing to its over-sensitivity to small input perturbations. While being critical, we argue that solving this singular issue alone fails to provide a comprehensive robustness assessment. Even worse, the conclusions drawn from singular robustness may give a false sense of overall model robustness. Specifically, our findings show that adversarially trained models that are robust to input perturbations are still (or even more) vulnerable to weight perturbations when compared to standard models. In this paper, we formalize the notion of non-singular adversarial robustness for neural networks through the lens of joint perturbations to data inputs as well as model weights. To our best knowledge, this study is the first work considering simultaneous input-weight adversarial perturbations. Based on a multi-layer feed-forward neural network model with ReLU activation functions and standard classification loss, we establish error analysis for quantifying the loss sensitivity subject to $\ell_\infty$-norm bounded perturbations on data inputs and model weights. Based on the error analysis, we propose novel regularization functions for robust training and demonstrate improved non-singular robustness against joint input-weight adversarial perturbations.

View on arXiv PDF

Similar