CR AINov 13, 2023

An Extensive Study on Adversarial Attack against Pre-trained Models of Code

Xiaohu Du, Ming Wen, Zichao Wei, Shangwen Wang, Hai Jin

arXiv:2311.07553v213.126 citationsh-index: 79Has Code

Originality Incremental advance

AI Analysis

It addresses security vulnerabilities in code intelligence systems, which is critical for mission-critical applications, but is incremental as it builds on prior attack methods.

This study systematically analyzes adversarial attacks on pre-trained models of code, finding that existing approaches fail to balance effectiveness, efficiency, and naturalness, and proposes a new method that outperforms the state-of-the-art ALERT by prioritizing identifier substitutions in specific statements like for and if.

Transformer-based pre-trained models of code (PTMC) have been widely utilized and have achieved state-of-the-art performance in many mission-critical applications. However, they can be vulnerable to adversarial attacks through identifier substitution or coding style transformation, which can significantly degrade accuracy and may further incur security concerns. Although several approaches have been proposed to generate adversarial examples for PTMC, the effectiveness and efficiency of such approaches, especially on different code intelligence tasks, has not been well understood. To bridge this gap, this study systematically analyzes five state-of-the-art adversarial attack approaches from three perspectives: effectiveness, efficiency, and the quality of generated examples. The results show that none of the five approaches balances all these perspectives. Particularly, approaches with a high attack success rate tend to be time-consuming; the adversarial code they generate often lack naturalness, and vice versa. To address this limitation, we explore the impact of perturbing identifiers under different contexts and find that identifier substitution within for and if statements is the most effective. Based on these findings, we propose a new approach that prioritizes different types of statements for various tasks and further utilizes beam search to generate adversarial examples. Evaluation results show that it outperforms the state-of-the-art ALERT in terms of both effectiveness and efficiency while preserving the naturalness of the generated adversarial examples.

View on arXiv PDF Code

Similar