LGMay 6, 2024

Decentralized Online Learning in General-Sum Stackelberg Games

arXiv:2405.03158v1UAI
Originality Incremental advance
AI Analysis

This work addresses strategic decision-making in multi-agent systems for researchers in game theory and online learning, offering incremental advances with new strategies and theoretical guarantees.

The paper tackles decentralized online learning in general-sum Stackelberg games by analyzing limited and side information settings, showing that myopic best response is optimal for the follower in limited information but not in side information, where strategic manipulation can improve outcomes, and it provides last-iterate convergence and sample complexity results with empirical validation.

We study an online learning problem in general-sum Stackelberg games, where players act in a decentralized and strategic manner. We study two settings depending on the type of information for the follower: (1) the limited information setting where the follower only observes its own reward, and (2) the side information setting where the follower has extra side information about the leader's reward. We show that for the follower, myopically best responding to the leader's action is the best strategy for the limited information setting, but not necessarily so for the side information setting -- the follower can manipulate the leader's reward signals with strategic actions, and hence induce the leader's strategy to converge to an equilibrium that is better off for itself. Based on these insights, we study decentralized online learning for both players in the two settings. Our main contribution is to derive last-iterate convergence and sample complexity results in both settings. Notably, we design a new manipulation strategy for the follower in the latter setting, and show that it has an intrinsic advantage against the best response strategy. Our theories are also supported by empirical results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes