CVCRLGJul 24, 2023

Latent Code Augmentation Based on Stable Diffusion for Data-free Substitute Attacks

arXiv:2307.12872v25 citationsh-index: 103Has Code
Originality Highly original
AI Analysis

This work addresses the challenge of efficient and accurate substitute model training in black-box adversarial attacks, which is incremental as it improves upon existing GANs-based approaches by leveraging diffusion models.

The paper tackles the problem of low training efficiency and generation quality in data-free black-box substitute attacks by proposing a novel scheme based on Stable Diffusion with Latent Code Augmentation (LCA) to generate high-quality, diverse data aligned with the target model's distribution, achieving higher attack success rates and requiring fewer query budgets compared to GANs-based methods.

Since the training data of the target model is not available in the black-box substitute attack, most recent schemes utilize GANs to generate data for training the substitute model. However, these GANs-based schemes suffer from low training efficiency as the generator needs to be retrained for each target model during the substitute training process, as well as low generation quality. To overcome these limitations, we consider utilizing the diffusion model to generate data, and propose a novel data-free substitute attack scheme based on the Stable Diffusion (SD) to improve the efficiency and accuracy of substitute training. Despite the data generated by the SD exhibiting high quality, it presents a different distribution of domains and a large variation of positive and negative samples for the target model. For this problem, we propose Latent Code Augmentation (LCA) to facilitate SD in generating data that aligns with the data distribution of the target model. Specifically, we augment the latent codes of the inferred member data with LCA and use them as guidance for SD. With the guidance of LCA, the data generated by the SD not only meets the discriminative criteria of the target model but also exhibits high diversity. By utilizing this data, it is possible to train the substitute model that closely resembles the target model more efficiently. Extensive experiments demonstrate that our LCA achieves higher attack success rates and requires fewer query budgets compared to GANs-based schemes for different target models. Our codes are available at \url{https://github.com/LzhMeng/LCA}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes