CVAIMar 29, 2024

On Inherent Adversarial Robustness of Active Vision Systems

arXiv:2404.00185v22 citationsh-index: 9Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses the problem of adversarial robustness in computer vision systems, offering a novel approach inspired by human vision, though it is incremental as it builds on existing active vision methods.

The paper tackled the vulnerability of deep neural networks to adversarial examples by integrating active vision mechanisms, such as saccades and foveation, into deep learning systems, resulting in 2-3 times greater robustness compared to standard passive networks under adversarial attacks.

Current Deep Neural Networks are vulnerable to adversarial examples, which alter their predictions by adding carefully crafted noise. Since human eyes are robust to such inputs, it is possible that the vulnerability stems from the standard way of processing inputs in one shot by processing every pixel with the same importance. In contrast, neuroscience suggests that the human vision system can differentiate salient features by (1) switching between multiple fixation points (saccades) and (2) processing the surrounding with a non-uniform external resolution (foveation). In this work, we advocate that the integration of such active vision mechanisms into current deep learning systems can offer robustness benefits. Specifically, we empirically demonstrate the inherent robustness of two active vision methods - GFNet and FALcon - under a black box threat model. By learning and inferencing based on downsampled glimpses obtained from multiple distinct fixation points within an input, we show that these active methods achieve (2-3) times greater robustness compared to a standard passive convolutional network under state-of-the-art adversarial attacks. More importantly, we provide illustrative and interpretable visualization analysis that demonstrates how performing inference from distinct fixation points makes active vision methods less vulnerable to malicious inputs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes