Is He Extroverted? Identifying Missing Relevant Personas for Faithful User Simulation
This addresses the issue of ensuring faithful user simulation in dialogue systems, which is incremental as it builds on existing approaches by focusing on missing personas.
The paper tackles the problem of identifying missing persona dimensions in user simulation for dialogue, which can lead to ambiguous user choices, by introducing the PICQ-drama benchmark and evaluating leading LLMs on it, showing feasibility and revealing cognitive differences between LLMs and humans.
Existing user simulation approaches focus on generating user-like responses in dialogue. They often assume that the provided persona is sufficient for producing such responses, without verifying whether critical personas are supplied. This raises concerns about the validity of simulation results. To address this issue, we study the task of identifying persona dimensions (e.g., "whether the user is price-sensitive") that are relevant but missing in simulating a user's reply for a given dialogue context. We introduce PICQ-drama (constructed from TVShowGuess), a benchmark of context-aware choice questions, annotated with missing persona dimensions whose absence leads to ambiguous user choices. We further design diverse evaluation criteria for missing persona identification. Benchmarking leading LLMs on our PICQ-drama dataset demonstrates the feasibility of this task. Evaluation across diverse criteria, along with further analyses, reveals cognitive differences between LLMs and humans and highlights the distinct roles of different persona categories in shaping responses.