LG CVApr 7, 2022

Adaptive-Gravity: A Defense Against Adversarial Samples

Ali Mirzaeian, Zhi Tian, Sai Manoj P D, Banafsheh S. Latibari, Ioannis Savidis, Houman Homayoun, Avesta Sasan

arXiv:2204.03694v13.33 citationsh-index: 33

Originality Highly original

AI Analysis

This addresses the security vulnerability of AI models to adversarial attacks, offering a novel defense mechanism for robust classification.

The paper tackles the problem of adversarial examples in deep neural networks by introducing Adaptive-Gravity, a training method that increases class separation and reduces feature spread, resulting in reduced fooling rates against attacks like FGSM, MIM, BIM, and PGD on MNIST and CIFAR10 datasets while also improving training accuracy.

This paper presents a novel model training solution, denoted as Adaptive-Gravity, for enhancing the robustness of deep neural network classifiers against adversarial examples. We conceptualize the model parameters/features associated with each class as a mass characterized by its centroid location and the spread (standard deviation of the distance) of features around the centroid. We use the centroid associated with each cluster to derive an anti-gravity force that pushes the centroids of different classes away from one another during network training. Then we customized an objective function that aims to concentrate each class's features toward their corresponding new centroid, which has been obtained by anti-gravity force. This methodology results in a larger separation between different masses and reduces the spread of features around each centroid. As a result, the samples are pushed away from the space that adversarial examples could be mapped to, effectively increasing the degree of perturbation needed for making an adversarial example. We have implemented this training solution as an iterative method consisting of four steps at each iteration: 1) centroid extraction, 2) anti-gravity force calculation, 3) centroid relocation, and 4) gravity training. Gravity's efficiency is evaluated by measuring the corresponding fooling rates against various attack models, including FGSM, MIM, BIM, and PGD using LeNet and ResNet110 networks, benchmarked against MNIST and CIFAR10 classification problems. Test results show that Gravity not only functions as a powerful instrument to robustify a model against state-of-the-art adversarial attacks but also effectively improves the model training accuracy.

View on arXiv PDF

Similar