LG CRMar 20, 2022

Adversarial Parameter Attack on Deep Neural Networks

arXiv:2203.10502v16.911 citationsh-index: 8

Originality Highly original

AI Analysis

This addresses security vulnerabilities in DNNs for applications like autonomous systems, though it is incremental as it builds on prior parameter perturbation attacks.

The paper tackles the problem of adversarial parameter attacks on deep neural networks, where small perturbations to model parameters significantly reduce robustness without affecting accuracy, and proves the existence of such attacks with theoretical conditions and effective training algorithms.

In this paper, a new parameter perturbation attack on DNNs, called adversarial parameter attack, is proposed, in which small perturbations to the parameters of the DNN are made such that the accuracy of the attacked DNN does not decrease much, but its robustness becomes much lower. The adversarial parameter attack is stronger than previous parameter perturbation attacks in that the attack is more difficult to be recognized by users and the attacked DNN gives a wrong label for any modified sample input with high probability. The existence of adversarial parameters is proved. For a DNN $F_Θ$ with the parameter set $Θ$ satisfying certain conditions, it is shown that if the depth of the DNN is sufficiently large, then there exists an adversarial parameter set $Θ_a$ for $Θ$ such that the accuracy of $F_{Θ_a}$ is equal to that of $F_Θ$, but the robustness measure of $F_{Θ_a}$ is smaller than any given bound. An effective training algorithm is given to compute adversarial parameters and numerical experiments are used to demonstrate that the algorithms are effective to produce high quality adversarial parameters.

View on arXiv PDF

Similar