LGGTSep 3, 2016

Randomized Prediction Games for Adversarial Machine Learning

arXiv:1609.00804v164 citations
Originality Incremental advance
AI Analysis

This work addresses security in spam and malware detection by introducing randomization to enhance classifier robustness against evasion attacks, representing an incremental advance over deterministic game-theoretic methods.

The paper tackles the problem of adversarial machine learning by proposing a randomized prediction game where both classifier and attacker use randomized strategies, improving the trade-off between attack detection and false alarms compared to state-of-the-art secure classifiers in applications like handwritten digit recognition, spam, and malware detection.

In spam and malware detection, attackers exploit randomization to obfuscate malicious data and increase their chances of evading detection at test time; e.g., malware code is typically obfuscated using random strings or byte sequences to hide known exploits. Interestingly, randomization has also been proposed to improve security of learning algorithms against evasion attacks, as it results in hiding information about the classifier to the attacker. Recent work has proposed game-theoretical formulations to learn secure classifiers, by simulating different evasion attacks and modifying the classification function accordingly. However, both the classification function and the simulated data manipulations have been modeled in a deterministic manner, without accounting for any form of randomization. In this work, we overcome this limitation by proposing a randomized prediction game, namely, a non-cooperative game-theoretic formulation in which the classifier and the attacker make randomized strategy selections according to some probability distribution defined over the respective strategy set. We show that our approach allows one to improve the trade-off between attack detection and false alarms with respect to state-of-the-art secure classifiers, even against attacks that are different from those hypothesized during design, on application examples including handwritten digit recognition, spam and malware detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes