LG CRMar 5, 2023

Local Environment Poisoning Attacks on Federated Reinforcement Learning

Evelyn Ma, Praneet Rathi, S. Rasoul Etesami

arXiv:2303.02725v47.78 citationsh-index: 3

Originality Incremental advance

AI Analysis

This work highlights a security problem in FRL systems, posing challenges for robust algorithm design, but it is incremental as it builds on known poisoning concerns in federated learning.

The authors tackled the vulnerability of Federated Reinforcement Learning (FRL) to poisoning attacks by malicious agents, proposing a framework that reduces performance across various environments more effectively than baseline methods.

Federated learning (FL) has become a popular tool for solving traditional Reinforcement Learning (RL) tasks. The multi-agent structure addresses the major concern of data-hungry in traditional RL, while the federated mechanism protects the data privacy of individual agents. However, the federated mechanism also exposes the system to poisoning by malicious agents that can mislead the trained policy. Despite the advantage brought by FL, the vulnerability of Federated Reinforcement Learning (FRL) has not been well-studied before. In this work, we propose a general framework to characterize FRL poisoning as an optimization problem and design a poisoning protocol that can be applied to policy-based FRL. Our framework can also be extended to FRL with actor-critic as a local RL algorithm by training a pair of private and public critics. We provably show that our method can strictly hurt the global objective. We verify our poisoning effectiveness by conducting extensive experiments targeting mainstream RL algorithms and over various RL OpenAI Gym environments covering a wide range of difficulty levels. Within these experiments, we compare clean and baseline poisoning methods against our proposed framework. The results show that the proposed framework is successful in poisoning FRL systems and reducing performance across various environments and does so more effectively than baseline methods. Our work provides new insights into the vulnerability of FL in RL training and poses new challenges for designing robust FRL algorithms

View on arXiv PDF

Similar