AIGTMar 4, 2025

Playing games with Large language models: Randomness and strategy

arXiv:2503.02582v17 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses the problem of LLM capabilities in strategic decision-making for researchers and developers, but it is incremental as it builds on existing critiques of LLM randomness.

The study investigated whether large language models (LLMs) can play games like Rock Paper Scissors and Prisoner's Dilemma, finding that they exhibit biased randomness and develop loss aversion strategies, with results showing convergence to stalemates or shifts between cooperation and competition.

Playing games has a long history of describing intricate interactions in simplified forms. In this paper we explore if large language models (LLMs) can play games, investigating their capabilities for randomisation and strategic adaptation through both simultaneous and sequential game interactions. We focus on GPT-4o-Mini-2024-08-17 and test two games between LLMs: Rock Paper Scissors (RPS) and games of strategy (Prisoners Dilemma PD). LLMs are often described as stochastic parrots, and while they may indeed be parrots, our results suggest that they are not very stochastic in the sense that their outputs - when prompted to be random - are often very biased. Our research reveals that LLMs appear to develop loss aversion strategies in repeated games, with RPS converging to stalemate conditions while PD shows systematic shifts between cooperative and competitive outcomes based on prompt design. We detail programmatic tools for independent agent interactions and the Agentic AI challenges faced in implementation. We show that LLMs can indeed play games, just not very well. These results have implications for the use of LLMs in multi-agent LLM systems and showcase limitations in current approaches to model output for strategic decision-making.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes