LG CVOct 1, 2023

Towards Understanding Adversarial Transferability in Federated Learning

arXiv:2310.00616v25.33 citationsh-index: 8

Originality Incremental advance

AI Analysis

This addresses security risks in federated learning for applications requiring privacy, though it is incremental as it builds on known adversarial attack methods.

The paper investigates covert adversarial attacks in federated learning where malicious clients initially act benign during training, then use their data to craft transferable attacks, achieving over 80% attack success with only 3% of client data. Surprisingly, federated learning shows higher robustness than centralized systems due to decentralized training and model averaging.

We investigate a specific security risk in FL: a group of malicious clients has impacted the model during training by disguising their identities and acting as benign clients but later switching to an adversarial role. They use their data, which was part of the training set, to train a substitute model and conduct transferable adversarial attacks against the federated model. This type of attack is subtle and hard to detect because these clients initially appear to be benign. The key question we address is: How robust is the FL system to such covert attacks, especially compared to traditional centralized learning systems? We empirically show that the proposed attack imposes a high security risk to current FL systems. By using only 3\% of the client's data, we achieve the highest attack rate of over 80\%. To further offer a full understanding of the challenges the FL system faces in transferable attacks, we provide a comprehensive analysis over the transfer robustness of FL across a spectrum of configurations. Surprisingly, FL systems show a higher level of robustness than their centralized counterparts, especially when both systems are equally good at handling regular, non-malicious data. We attribute this increased robustness to two main factors: 1) Decentralized Data Training: Each client trains the model on its own data, reducing the overall impact of any single malicious client. 2) Model Update Averaging: The updates from each client are averaged together, further diluting any malicious alterations. Both practical experiments and theoretical analysis support our conclusions. This research not only sheds light on the resilience of FL systems against hidden attacks but also raises important considerations for their future application and development.

View on arXiv PDF

Similar