CRAISEJan 18, 2021

DeepPayload: Black-box Backdoor Attack on Deep Learning Models through Neural Payload Injection

arXiv:2101.06896v196 citations
Originality Highly original
AI Analysis

This addresses a security vulnerability for developers and users of mobile deep learning apps, revealing that 54 real-world apps are vulnerable, making it a novel and impactful finding rather than incremental.

The paper tackles the problem of backdoor attacks on deep learning models in mobile applications by introducing a black-box attack method called DeepPayload, which injects a neural payload to achieve a 93.5% success rate with minimal latency and accuracy impact.

Deep learning models are increasingly used in mobile applications as critical components. Unlike the program bytecode whose vulnerabilities and threats have been widely-discussed, whether and how the deep learning models deployed in the applications can be compromised are not well-understood since neural networks are usually viewed as a black box. In this paper, we introduce a highly practical backdoor attack achieved with a set of reverse-engineering techniques over compiled deep learning models. The core of the attack is a neural conditional branch constructed with a trigger detector and several operators and injected into the victim model as a malicious payload. The attack is effective as the conditional logic can be flexibly customized by the attacker, and scalable as it does not require any prior knowledge from the original model. We evaluated the attack effectiveness using 5 state-of-the-art deep learning models and real-world samples collected from 30 users. The results demonstrated that the injected backdoor can be triggered with a success rate of 93.5%, while only brought less than 2ms latency overhead and no more than 1.4% accuracy decrease. We further conducted an empirical study on real-world mobile deep learning apps collected from Google Play. We found 54 apps that were vulnerable to our attack, including popular and security-critical ones. The results call for the awareness of deep learning application developers and auditors to enhance the protection of deployed models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes