LG AINov 7, 2024

Defending Deep Regression Models against Backdoor Attacks

Lingyu Du, Yupei Liu, Jinyuan Jia, Guohao Lan

arXiv:2411.04811v17.93 citationsh-index: 9

Originality Highly original

AI Analysis

This addresses a critical security vulnerability in safety-critical applications using deep regression models, representing a novel domain-specific advancement rather than an incremental improvement.

The paper tackles the problem of defending deep regression models against backdoor attacks, which are undefended by existing methods due to continuous outputs and feature-space triggers, and proposes DRMGuard as the first defense, showing it consistently outperforms adapted state-of-the-art defenses in evaluations on two tasks and four datasets.

Deep regression models are used in a wide variety of safety-critical applications, but are vulnerable to backdoor attacks. Although many defenses have been proposed for classification models, they are ineffective as they do not consider the uniqueness of regression models. First, the outputs of regression models are continuous values instead of discretized labels. Thus, the potential infected target of a backdoored regression model has infinite possibilities, which makes it impossible to be determined by existing defenses. Second, the backdoor behavior of backdoored deep regression models is triggered by the activation values of all the neurons in the feature space, which makes it difficult to be detected and mitigated using existing defenses. To resolve these problems, we propose DRMGuard, the first defense to identify if a deep regression model in the image domain is backdoored or not. DRMGuard formulates the optimization problem for reverse engineering based on the unique output-space and feature-space characteristics of backdoored deep regression models. We conduct extensive evaluations on two regression tasks and four datasets. The results show that DRMGuard can consistently defend against various backdoor attacks. We also generalize four state-of-the-art defenses designed for classifiers to regression models, and compare DRMGuard with them. The results show that DRMGuard significantly outperforms all those defenses.

View on arXiv PDF

Similar