CLAug 5, 2024

LLM economicus? Mapping the Behavioral Biases of LLMs via Utility Theory

arXiv:2408.02784v145 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the problem of understanding and mitigating economic biases in LLMs for researchers and practitioners deploying them in decision-making support, though it is incremental in applying existing economic theory to AI evaluation.

The study investigated whether large language models (LLMs) exhibit human-like behavioral biases such as loss aversion and anchoring, using utility theory to quantify their economic behavior. It found that current LLMs are neither fully human-like nor perfectly rational, and they often fail to maintain consistent behavior across settings.

Humans are not homo economicus (i.e., rational economic beings). As humans, we exhibit systematic behavioral biases such as loss aversion, anchoring, framing, etc., which lead us to make suboptimal economic decisions. Insofar as such biases may be embedded in text data on which large language models (LLMs) are trained, to what extent are LLMs prone to the same behavioral biases? Understanding these biases in LLMs is crucial for deploying LLMs to support human decision-making. We propose utility theory-a paradigm at the core of modern economic theory-as an approach to evaluate the economic biases of LLMs. Utility theory enables the quantification and comparison of economic behavior against benchmarks such as perfect rationality or human behavior. To demonstrate our approach, we quantify and compare the economic behavior of a variety of open- and closed-source LLMs. We find that the economic behavior of current LLMs is neither entirely human-like nor entirely economicus-like. We also find that most current LLMs struggle to maintain consistent economic behavior across settings. Finally, we illustrate how our approach can measure the effect of interventions such as prompting on economic biases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes