GTLGAug 15, 2024

Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interactions

Harvard
arXiv:2408.08272v27 citationsh-index: 25
Originality Incremental advance
AI Analysis

This addresses a foundational problem in game theory and multi-agent learning, showing limitations in learning from strategic interactions, which is incremental but clarifies theoretical boundaries.

The paper investigates whether agents in strategic environments can learn to achieve their optimal outcomes through repeated interactions, despite uncertainty about preferences, and finds that if one player has perfect knowledge, the informational gap persists, preventing the uninformed player from achieving her Stackelberg value.

When learning in strategic environments, a key question is whether agents can overcome uncertainty about their preferences to achieve outcomes they could have achieved absent any uncertainty. Can they do this solely through interactions with each other? We focus this question on the ability of agents to attain the value of their Stackelberg optimal strategy and study the impact of information asymmetry. We study repeated interactions in fully strategic environments where players' actions are decided based on learning algorithms that take into account their observed histories and knowledge of the game. We study the pure Nash equilibria (PNE) of a meta-game where players choose these algorithms as their actions. We demonstrate that if one player has perfect knowledge about the game, then any initial informational gap persists. That is, while there is always a PNE in which the informed agent achieves her Stackelberg value, there is a game where no PNE of the meta-game allows the partially informed player to achieve her Stackelberg value. On the other hand, if both players start with some uncertainty about the game, the quality of information alone does not determine which agent can achieve her Stackelberg value. In this case, the concept of information asymmetry becomes nuanced and depends on the game's structure. Overall, our findings suggest that repeated strategic interactions alone cannot facilitate learning effectively enough to earn an uninformed player her Stackelberg value.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes