LG MLFeb 10, 2020

Playing to Learn Better: Repeated Games for Adversarial Learning with Multiple Classifiers

Prithviraj Dasgupta, Joseph B. Collins, Michael McCarrick

arXiv:2002.03924v12.31 citations

Originality Incremental advance

AI Analysis

This work addresses adversarial evasion attacks in machine learning, offering a game-theoretic approach for robust classifier selection, though it is incremental as it builds on existing adversarial learning and game theory methods.

The paper tackles the problem of adversarial learning where a learner must correctly predict classes from both clean and adversarial queries, proposing a Repeated Bayesian Sequential Game that uses self-play to select classifiers balancing prediction accuracy and cost, and demonstrates that the learner can appropriately match classifiers to query types while managing costs.

We consider the problem of prediction by a machine learning algorithm, called learner, within an adversarial learning setting. The learner's task is to correctly predict the class of data passed to it as a query. However, along with queries containing clean data, the learner could also receive malicious or adversarial queries from an adversary. The objective of the adversary is to evade the learner's prediction mechanism by sending adversarial queries that result in erroneous class prediction by the learner, while the learner's objective is to reduce the incorrect prediction of these adversarial queries without degrading the prediction quality of clean queries. We propose a game theory-based technique called a Repeated Bayesian Sequential Game where the learner interacts repeatedly with a model of the adversary using self play to determine the distribution of adversarial versus clean queries. It then strategically selects a classifier from a set of pre-trained classifiers that balances the likelihood of correct prediction for the query along with reducing the costs to use the classifier. We have evaluated our proposed technique using clean and adversarial text data with deep neural network-based classifiers and shown that the learner can select an appropriate classifier that is commensurate with the query type (clean or adversarial) while remaining aware of the cost to use the classifier.

View on arXiv PDF

Similar