AIAug 19, 2024

A Disguised Wolf Is More Harmful Than a Toothless Tiger: Adaptive Malicious Code Injection Backdoor Attack Leveraging User Behavior as Triggers

arXiv:2408.10334v12.31 citationsh-index: 6

Originality Highly original

AI Analysis

This addresses security vulnerabilities in code generation models for software developers, presenting a novel attack scenario that is incremental in building on existing robustness issues.

The paper tackles security risks in large language models for code generation by proposing a game-theoretic model for backdoor attacks that adaptively inject malicious code based on user skill levels, validated through experiments on leading models to highlight significant threats.

In recent years, large language models (LLMs) have made significant progress in the field of code generation. However, as more and more users rely on these models for software development, the security risks associated with code generation models have become increasingly significant. Studies have shown that traditional deep learning robustness issues also negatively impact the field of code generation. In this paper, we first present the game-theoretic model that focuses on security issues in code generation scenarios. This framework outlines possible scenarios and patterns where attackers could spread malicious code models to create security threats. We also pointed out for the first time that the attackers can use backdoor attacks to dynamically adjust the timing of malicious code injection, which will release varying degrees of malicious code depending on the skill level of the user. Through extensive experiments on leading code generation models, we validate our proposed game-theoretic model and highlight the significant threats that these new attack scenarios pose to the safe use of code models.

View on arXiv PDF

Similar