LGNov 25, 2021

Clustering Effect of (Linearized) Adversarial Robust Models

Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang

arXiv:2111.12922v16.56 citationsh-index: 48Has Code

Originality Incremental advance

AI Analysis

This work provides a novel interpretation of adversarial robustness, potentially benefiting researchers in machine learning security and model interpretability, though it appears incremental in building on existing robustness studies.

The paper investigates the underlying mechanism of adversarial robustness by analyzing linear components of robust models, finding a hierarchical clustering effect in linearized sub-networks, and applies this understanding to tasks like domain adaptation and robustness boosting.

Adversarial robustness has received increasing attention along with the study of adversarial examples. So far, existing works show that robust models not only obtain robustness against various adversarial attacks but also boost the performance in some downstream tasks. However, the underlying mechanism of adversarial robustness is still not clear. In this paper, we interpret adversarial robustness from the perspective of linear components, and find that there exist some statistical properties for comprehensively robust models. Specifically, robust models show obvious hierarchical clustering effect on their linearized sub-networks, when removing or replacing all non-linear components (e.g., batch normalization, maximum pooling, or activation layers). Based on these observations, we propose a novel understanding of adversarial robustness and apply it on more tasks including domain adaption and robustness boosting. Experimental evaluations demonstrate the rationality and superiority of our proposed clustering strategy.

View on arXiv PDF Code

Similar