AIJan 8

Effects of personality steering on cooperative behavior in Large Language Model agents

arXiv:2601.05302v2h-index: 1
Originality Incremental advance
AI Analysis

This addresses the problem of controlling LLM agent behavior in strategic interactions for AI researchers, though it is incremental as it builds on existing personality studies.

The study investigated how assigning personality traits to LLM agents affects their cooperative behavior in repeated Prisoner's Dilemma games, finding that agreeableness promotes cooperation across models like GPT-3.5-turbo, GPT-4o, and GPT-5, but personality steering increases vulnerability to exploitation in earlier models.

Large language models (LLMs) are increasingly used as autonomous agents in strategic and social interactions. Although recent studies suggest that assigning personality traits to LLMs can influence their behavior, how personality steering affects cooperation under controlled conditions remains unclear. In this study, we examine the effects of personality steering on cooperative behavior in LLM agents using repeated Prisoner's Dilemma games. Based on the Big Five framework, we first measure basic personality scores of three models, GPT-3.5-turbo, GPT-4o, and GPT-5, using the Big Five Inventory. We then compare behavior under baseline and personality-informed conditions, and further analyze the effects of independently manipulating each personality dimension to extreme values. Our results show that agreeableness is the dominant factor promoting cooperation across all models, while other personality traits have limited impact. Explicit personality information increases cooperation but can also raise vulnerability to exploitation, particularly in earlier-generation models. In contrast, later-generation models exhibit more selective cooperation. These findings indicate that personality steering acts as a behavioral bias rather than a deterministic control mechanism.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes