CRLGNov 29, 2022

Backdoor Vulnerabilities in Normally Trained Deep Learning Models

arXiv:2211.15929v118 citationsh-index: 43
Originality Incremental advance
AI Analysis

This addresses security risks in deep learning for practitioners by revealing inherent vulnerabilities, though it is incremental as it builds on known backdoor attack concepts.

The study systematically investigates backdoor vulnerabilities in normally trained deep learning models, finding that natural backdoors are widespread and correspond to most injected attacks, with a proposed detection framework identifying 315 natural backdoors in 56 models compared to at most 65 by existing methods.

We conduct a systematic study of backdoor vulnerabilities in normally trained Deep Learning models. They are as dangerous as backdoors injected by data poisoning because both can be equally exploited. We leverage 20 different types of injected backdoor attacks in the literature as the guidance and study their correspondences in normally trained models, which we call natural backdoor vulnerabilities. We find that natural backdoors are widely existing, with most injected backdoor attacks having natural correspondences. We categorize these natural backdoors and propose a general detection framework. It finds 315 natural backdoors in the 56 normally trained models downloaded from the Internet, covering all the different categories, while existing scanners designed for injected backdoors can at most detect 65 backdoors. We also study the root causes and defense of natural backdoors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes