CRAILGOct 16, 2023

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

arXiv:2310.10780v212 citationsh-index: 13
Originality Incremental advance
AI Analysis

It addresses security risks for ML systems by providing a statistical framework to analyze backdoor attacks, which is incremental as it builds on existing attack scenarios.

This paper tackles the problem of understanding and evaluating the effectiveness of poisoning backdoor attacks in machine learning by establishing tight lower and upper bounds for model performance on clean and backdoor test data, with experiments on benchmark datasets and state-of-the-art attack scenarios.

The growing dependence on machine learning in real-world applications emphasizes the importance of understanding and ensuring its safety. Backdoor attacks pose a significant security risk due to their stealthy nature and potentially serious consequences. Such attacks involve embedding triggers within a learning model with the intention of causing malicious behavior when an active trigger is present while maintaining regular functionality without it. This paper evaluates the effectiveness of any backdoor attack incorporating a constant trigger, by establishing tight lower and upper boundaries for the performance of the compromised model on both clean and backdoor test data. The developed theory answers a series of fundamental but previously underexplored problems, including (1) what are the determining factors for a backdoor attack's success, (2) what is the direction of the most effective backdoor attack, and (3) when will a human-imperceptible trigger succeed. Our derived understanding applies to both discriminative and generative models. We also demonstrate the theory by conducting experiments using benchmark datasets and state-of-the-art backdoor attack scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes