LGCRNIMLSep 17, 2020

Finding Effective Security Strategies through Reinforcement Learning and Self-Play

arXiv:2009.08120v270 citations
AI Analysis

This work addresses network security for intrusion prevention, showing incremental application of self-play from other domains to this context.

The paper tackled the problem of automatically finding effective security strategies for intrusion prevention by modeling attacker-defender interactions as a Markov game and evolving strategies through reinforcement learning and self-play. The result demonstrated that effective strategies emerged, outperforming two baseline methods, though policy convergence remained a challenge.

We present a method to automatically find security strategies for the use case of intrusion prevention. Following this method, we model the interaction between an attacker and a defender as a Markov game and let attack and defense strategies evolve through reinforcement learning and self-play without human intervention. Using a simple infrastructure configuration, we demonstrate that effective security strategies can emerge from self-play. This shows that self-play, which has been applied in other domains with great success, can be effective in the context of network security. Inspection of the converged policies show that the emerged policies reflect common-sense knowledge and are similar to strategies of humans. Moreover, we address known challenges of reinforcement learning in this domain and present an approach that uses function approximation, an opponent pool, and an autoregressive policy representation. Through evaluations we show that our method is superior to two baseline methods but that policy convergence in self-play remains a challenge.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes