CVSep 7, 2022

On the Transferability of Adversarial Examples between Encrypted Models

arXiv:2209.02997v14 citationsh-index: 35
Originality Incremental advance
AI Analysis

This work addresses security concerns for machine learning practitioners by showing incremental improvements in adversarial defense through encryption.

The paper tackled the problem of adversarial transferability between encrypted models designed for defense, finding that using encrypted models not only improves robustness against adversarial examples but also reduces their transferability, with results evaluated using the AutoAttack benchmark.

Deep neural networks (DNNs) are well known to be vulnerable to adversarial examples (AEs). In addition, AEs have adversarial transferability, namely, AEs generated for a source model fool other (target) models. In this paper, we investigate the transferability of models encrypted for adversarially robust defense for the first time. To objectively verify the property of transferability, the robustness of models is evaluated by using a benchmark attack method, called AutoAttack. In an image-classification experiment, the use of encrypted models is confirmed not only to be robust against AEs but to also reduce the influence of AEs in terms of the transferability of models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes