LG CVDec 15, 2023

Fragility, Robustness and Antifragility in Deep Learning

Chandresh Pravin, Ivan Martino, Giuseppe Nicosia, Varun Ojha

arXiv:2312.09821v28.86 citationsh-index: 8Artif Intell

Originality Incremental advance

AI Analysis

This work addresses model robustness against adversarial attacks in deep learning, offering an incremental method for enhancing performance.

The authors tackled the problem of analyzing deep neural networks for fragility, robustness, and antifragility by proposing synaptic filtering to identify parameter characteristics, and they showed that selectively retraining robust and antifragile parameters improves test accuracy on adversarial datasets, with specific improvements for ResNet and ShuffleNet models.

We propose a systematic analysis of deep neural networks (DNNs) based on a signal processing technique for network parameter removal, in the form of synaptic filters that identifies the fragility, robustness and antifragility characteristics of DNN parameters. Our proposed analysis investigates if the DNN performance is impacted negatively, invariantly, or positively on both clean and adversarially perturbed test datasets when the DNN undergoes synaptic filtering. We define three \textit{filtering scores} for quantifying the fragility, robustness and antifragility characteristics of DNN parameters based on the performances for (i) clean dataset, (ii) adversarial dataset, and (iii) the difference in performances of clean and adversarial datasets. We validate the proposed systematic analysis on ResNet-18, ResNet-50, SqueezeNet-v1.1 and ShuffleNet V2 x1.0 network architectures for MNIST, CIFAR10 and Tiny ImageNet datasets. The filtering scores, for a given network architecture, identify network parameters that are invariant in characteristics across different datasets over learning epochs. Vice-versa, for a given dataset, the filtering scores identify the parameters that are invariant in characteristics across different network architectures. We show that our synaptic filtering method improves the test accuracy of ResNet and ShuffleNet models on adversarial datasets when only the robust and antifragile parameters are selectively retrained at any given epoch, thus demonstrating applications of the proposed strategy in improving model robustness.

View on arXiv PDF

Similar