LG MLFeb 10, 2020

Adversarial Attacks on Linear Contextual Bandits

Evrard Garcelon, Baptiste Roziere, Laurent Meunier, Jean Tarbouriech, Olivier Teytaud, Alessandro Lazaric, Matteo Pirotta

arXiv:2002.03839v319.559 citations

Originality Highly original

AI Analysis

This addresses security vulnerabilities in bandit algorithms used in domains like advertising and recommender systems, where malicious actors could manipulate outcomes, representing a novel investigation into adversarial threats in this setting.

The paper tackles the problem of adversarial attacks on linear contextual bandit algorithms, showing that a malicious agent can force the algorithm to pull a desired arm nearly all times over a horizon T, with adversarial modifications growing only logarithmically as O(log T).

Contextual bandit algorithms are applied in a wide range of domains, from advertising to recommender systems, from clinical trials to education. In many of these domains, malicious agents may have incentives to attack the bandit algorithm to induce it to perform a desired behavior. For instance, an unscrupulous ad publisher may try to increase their own revenue at the expense of the advertisers; a seller may want to increase the exposure of their products, or thwart a competitor's advertising campaign. In this paper, we study several attack scenarios and show that a malicious agent can force a linear contextual bandit algorithm to pull any desired arm $T - o(T)$ times over a horizon of $T$ steps, while applying adversarial modifications to either rewards or contexts that only grow logarithmically as $O(\log T)$. We also investigate the case when a malicious agent is interested in affecting the behavior of the bandit algorithm in a single context (e.g., a specific user). We first provide sufficient conditions for the feasibility of the attack and we then propose an efficient algorithm to perform the attack. We validate our theoretical results on experiments performed on both synthetic and real-world datasets.

View on arXiv PDF

Similar