CR AIAug 11, 2023

CyberForce: A Federated Reinforcement Learning Framework for Malware Mitigation

Chao Feng, Alberto Huertas Celdran, Pedro Miguel Sanchez Sanchez, Jan Kreischer, Jan von der Assen, Gerome Bovet, Gregorio Martinez Perez, Burkhard Stiller

arXiv:2308.05978v45.77 citationsh-index: 37

Originality Incremental advance

AI Analysis

This work addresses cybersecurity for IoT devices by improving privacy and speed in malware mitigation, though it is incremental as it builds on existing RL and federated learning methods.

The authors tackled the problem of slow and privacy-invasive centralized reinforcement learning for moving target defense against zero-day malware in IoT devices by proposing CyberForce, a federated reinforcement learning framework that learns effective defenses faster than centralized approaches and benefits from knowledge transfer across devices.

Recent research has shown that the integration of Reinforcement Learning (RL) with Moving Target Defense (MTD) can enhance cybersecurity in Internet-of-Things (IoT) devices. Nevertheless, the practicality of existing work is hindered by data privacy concerns associated with centralized data processing in RL, and the unsatisfactory time needed to learn right MTD techniques that are effective against a rising number of heterogeneous zero-day attacks. Thus, this work presents CyberForce, a framework that combines Federated and Reinforcement Learning (FRL) to collaboratively and privately learn suitable MTD techniques for mitigating zero-day attacks. CyberForce integrates device fingerprinting and anomaly detection to reward or penalize MTD mechanisms chosen by an FRL-based agent. The framework has been deployed and evaluated in a scenario consisting of ten physical devices of a real IoT platform affected by heterogeneous malware samples. A pool of experiments has demonstrated that CyberForce learns the MTD technique mitigating each attack faster than existing RL-based centralized approaches. In addition, when various devices are exposed to different attacks, CyberForce benefits from knowledge transfer, leading to enhanced performance and reduced learning time in comparison to recent works. Finally, different aggregation algorithms used during the agent learning process provide CyberForce with notable robustness to malicious attacks.

View on arXiv PDF

Similar