CVNov 18, 2020

Contextual Fusion For Adversarial Robustness

Aiswarya Akumalla, Seth Haney, Maksim Bazhenov

arXiv:2011.09526v12.33 citations

Originality Incremental advance

AI Analysis

This work offers an incremental improvement in adversarial robustness for deep neural networks, particularly for image classification tasks, by introducing a biologically inspired fusion mechanism.

The paper addresses the vulnerability of deep neural networks to adversarial perturbations by developing a fusion model that combines background and foreground features from Places-CNN and Imagenet-CNN. This approach significantly improves classification robustness against gradient-based attacks on CIFAR-10 and MS COCO, without degrading performance on clean data or requiring adversarial retraining. It also shows improvements for Gaussian blur perturbations.

Mammalian brains handle complex reasoning tasks in a gestalt manner by integrating information from regions of the brain that are specialised to individual sensory modalities. This allows for improved robustness and better generalisation ability. In contrast, deep neural networks are usually designed to process one particular information stream and susceptible to various types of adversarial perturbations. While many methods exist for detecting and defending against adversarial attacks, they do not generalise across a range of attacks and negatively affect performance on clean, unperturbed data. We developed a fusion model using a combination of background and foreground features extracted in parallel from Places-CNN and Imagenet-CNN. We tested the benefits of the fusion approach on preserving adversarial robustness for human perceivable (e.g., Gaussian blur) and network perceivable (e.g., gradient-based) attacks for CIFAR-10 and MS COCO data sets. For gradient based attacks, our results show that fusion allows for significant improvements in classification without decreasing performance on unperturbed data and without need to perform adversarial retraining. Our fused model revealed improvements for Gaussian blur type perturbations as well. The increase in performance from fusion approach depended on the variability of the image contexts; larger increases were seen for classes of images with larger differences in their contexts. We also demonstrate the effect of regularization to bias the classifier decision in the presence of a known adversary. We propose that this biologically inspired approach to integrate information across multiple modalities provides a new way to improve adversarial robustness that can be complementary to current state of the art approaches.

View on arXiv PDF

Similar