LG CRAug 22, 2023

FilterFL: Knowledge Filtering-based Data-Free Backdoor Defense for Federated Learning

Yanxin Yang, Ming Hu, Xiaofei Xie, Yue Cao, Pengyu Zhang, Yihao Huang, Mingsong Chen

arXiv:2308.11333v27.79 citationsh-index: 38

Originality Incremental advance

AI Analysis

This addresses a critical security problem in distributed machine learning for applications requiring privacy, though it is an incremental improvement on existing defense methods.

The paper tackles the vulnerability of Federated Learning to backdoor attacks by proposing a data-free defense method that filters poisoned models using generated trigger images, achieving defense against nearly all existing attack types and outperforming seven state-of-the-art methods, even with 80% malicious clients.

As a distributed machine learning paradigm, Federated Learning (FL) enables large-scale clients to collaboratively train a model without sharing their raw data. However, due to the lack of data auditing for untrusted clients, FL is vulnerable to poisoning attacks, especially backdoor attacks. By using poisoned data for local training or directly changing the model parameters, attackers can easily inject backdoors into the model, which can trigger the model to make misclassification of targeted patterns in images. To address these issues, we propose a novel data-free trigger-generation-based defense approach based on the two characteristics of backdoor attacks: i) triggers are learned faster than normal knowledge, and ii) trigger patterns have a greater effect on image classification than normal class patterns. Our approach generates the images with newly learned knowledge by identifying the differences between the old and new global models, and filters trigger images by evaluating the effect of these generated images. By using these trigger images, our approach eliminates poisoned models to ensure the updated global model is benign. Comprehensive experiments demonstrate that our approach can defend against almost all the existing types of backdoor attacks and outperform all the seven state-of-the-art defense methods with both IID and non-IID scenarios. Especially, our approach can successfully defend against the backdoor attack even when 80\% of the clients are malicious.

View on arXiv PDF

Similar