CLMay 26, 2023

Playing repeated games with Large Language Models

arXiv:2305.16867v2260 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of understanding LLM social behavior for applications involving human-agent interactions, representing an incremental advance by applying behavioral game theory to machines.

The study investigated how Large Language Models (LLMs) behave in repeated games, finding they excel in self-interested scenarios like the iterated Prisoner's Dilemma but perform poorly in coordination games like the Battle of the Sexes, with GPT-4 showing improved scores and coordination when given opponent information or using a social chain-of-thought strategy.

LLMs are increasingly used in applications where they interact with humans and other agents. We propose to use behavioural game theory to study LLM's cooperation and coordination behaviour. We let different LLMs play finitely repeated $2\times2$ games with each other, with human-like strategies, and actual human players. Our results show that LLMs perform particularly well at self-interested games like the iterated Prisoner's Dilemma family. However, they behave sub-optimally in games that require coordination, like the Battle of the Sexes. We verify that these behavioural signatures are stable across robustness checks. We additionally show how GPT-4's behaviour can be modulated by providing additional information about its opponent and by using a "social chain-of-thought" (SCoT) strategy. This also leads to better scores and more successful coordination when interacting with human players. These results enrich our understanding of LLM's social behaviour and pave the way for a behavioural game theory for machines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes