LG CRSep 20, 2024

Persistent Backdoor Attacks in Continual Learning

arXiv:2409.13864v311.513 citationsh-index: 16

Originality Incremental advance

AI Analysis

This addresses a security threat for continual learning systems, particularly in critical applications, but is incremental as it builds on existing backdoor attack research.

The paper tackles the problem of backdoor attacks in continual learning by introducing two persistent attacks, Blind Task Backdoor and Latent Task Backdoor, which achieve high success rates across different algorithms and evade state-of-the-art defenses.

Backdoor attacks pose a significant threat to neural networks, enabling adversaries to manipulate model outputs on specific inputs, often with devastating consequences, especially in critical applications. While backdoor attacks have been studied in various contexts, little attention has been given to their practicality and persistence in continual learning, particularly in understanding how the continual updates to model parameters, as new data distributions are learned and integrated, impact the effectiveness of these attacks over time. To address this gap, we introduce two persistent backdoor attacks-Blind Task Backdoor and Latent Task Backdoor-each leveraging minimal adversarial influence. Our blind task backdoor subtly alters the loss computation without direct control over the training process, while the latent task backdoor influences only a single task's training, with all other tasks trained benignly. We evaluate these attacks under various configurations, demonstrating their efficacy with static, dynamic, physical, and semantic triggers. Our results show that both attacks consistently achieve high success rates across different continual learning algorithms, while effectively evading state-of-the-art defenses, such as SentiNet and I-BAU.

View on arXiv PDF

Similar