CLAIHCJun 24, 2025

Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation

arXiv:2506.19352v17 citationsh-index: 7Has CodeACL
Originality Incremental advance
AI Analysis

This addresses the issue of persona fidelity for developers and users of LLMs in open-ended generation, though it is incremental as it builds on existing evaluation methods.

The paper tackles the problem of Out-of-Character behavior in large language models by proposing an atomic-level evaluation framework with three metrics to quantify persona fidelity at a finer granularity, demonstrating that it effectively detects persona inconsistencies overlooked by prior methods.

Ensuring persona fidelity in large language models (LLMs) is essential for maintaining coherent and engaging human-AI interactions. However, LLMs often exhibit Out-of-Character (OOC) behavior, where generated responses deviate from an assigned persona, leading to inconsistencies that affect model reliability. Existing evaluation methods typically assign single scores to entire responses, struggling to capture subtle persona misalignment, particularly in long-form text generation. To address this limitation, we propose an atomic-level evaluation framework that quantifies persona fidelity at a finer granularity. Our three key metrics measure the degree of persona alignment and consistency within and across generations. Our approach enables a more precise and realistic assessment of persona fidelity by identifying subtle deviations that real users would encounter. Through our experiments, we demonstrate that our framework effectively detects persona inconsistencies that prior methods overlook. By analyzing persona fidelity across diverse tasks and personality types, we reveal how task structure and persona desirability influence model adaptability, highlighting challenges in maintaining consistent persona expression.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes