GTAIOCMar 13, 2024

Strategizing against Q-learners: A Control-theoretical Approach

arXiv:2403.08906v36 citationsh-index: 2IEEE Control Systems Letters
Originality Incremental advance
AI Analysis

This work addresses the vulnerability of widely used multi-agent reinforcement learning methods to strategic manipulation, which is an incremental but important problem for AI security and game theory.

The paper investigates how sophisticated opponents can exploit independent Q-learning algorithms in repeated normal-form games by modeling the interactions as a stochastic game and using a quantization-based approximation scheme to analyze the exploitation potential.

In this paper, we explore the susceptibility of the independent Q-learning algorithms (a classical and widely used multi-agent reinforcement learning method) to strategic manipulation of sophisticated opponents in normal-form games played repeatedly. We quantify how much strategically sophisticated agents can exploit naive Q-learners if they know the opponents' Q-learning algorithm. To this end, we formulate the strategic actors' interactions as a stochastic game (whose state encompasses Q-function estimates of the Q-learners) as if the Q-learning algorithms are the underlying dynamical system. We also present a quantization-based approximation scheme to tackle the continuum state space and analyze its performance for two competing strategic actors and a single strategic actor both analytically and numerically.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes