GTAIJun 15, 2024

Large Language Models Playing Mixed Strategy Nash Equilibrium Games

arXiv:2406.10574v213 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of applying LLMs to game theory for researchers and practitioners, but it is incremental as it builds on existing knowledge of LLM limitations in complex scenarios.

The paper investigates the ability of large language models (LLMs) to find Nash equilibria in mixed-strategy games, showing that performance improves significantly when LLMs can run code with specific prompts, but declines with slight game modifications or complex randomization strategies.

Generative artificial intelligence (Generative AI), and in particular Large Language Models (LLMs) have gained significant popularity among researchers and industrial communities, paving the way for integrating LLMs in different domains, such as robotics, telecom, and healthcare. In this paper, we study the intersection of game theory and generative artificial intelligence, focusing on the capabilities of LLMs to find the Nash equilibrium in games with a mixed strategy Nash equilibrium and no pure strategy Nash equilibrium (that we denote mixed strategy Nash equilibrium games). The study reveals a significant enhancement in the performance of LLMs when they are equipped with the possibility to run code and are provided with a specific prompt to incentivize them to do so. However, our research also highlights the limitations of LLMs when the randomization strategy of the game is not easy to deduce. It is evident that while LLMs exhibit remarkable proficiency in well-known standard games, their performance dwindles when faced with slight modifications of the same games. This paper aims to contribute to the growing body of knowledge on the intersection of game theory and generative artificial intelligence while providing valuable insights into LLMs strengths and weaknesses. It also underscores the need for further research to overcome the limitations of LLMs, particularly in dealing with even slightly more complex scenarios, to harness their full potential.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes