CLJan 13, 2025

Investigating Large Language Models in Inferring Personality Traits from User Conversations

arXiv:2501.07532v16 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

It addresses the problem of automating psychological assessment for researchers and clinicians, but it is incremental as it applies existing LLMs to a new task with structured prompting.

This study evaluated whether GPT-4o and GPT-4o mini could infer Big Five personality traits from user conversations under zero-shot prompting, finding that prompting for BFI-10 item scores first enhanced accuracy and that the models showed differential performance in groups with depressive symptoms.

Large Language Models (LLMs) are demonstrating remarkable human like capabilities across diverse domains, including psychological assessment. This study evaluates whether LLMs, specifically GPT-4o and GPT-4o mini, can infer Big Five personality traits and generate Big Five Inventory-10 (BFI-10) item scores from user conversations under zero-shot prompting conditions. Our findings reveal that incorporating an intermediate step--prompting for BFI-10 item scores before calculating traits--enhances accuracy and aligns more closely with the gold standard than direct trait inference. This structured approach underscores the importance of leveraging psychological frameworks in improving predictive precision. Additionally, a group comparison based on depressive symptom presence revealed differential model performance. Participants were categorized into two groups: those experiencing at least one depressive symptom and those without symptoms. GPT-4o mini demonstrated heightened sensitivity to depression-related shifts in traits such as Neuroticism and Conscientiousness within the symptom-present group, whereas GPT-4o exhibited strengths in nuanced interpretation across groups. These findings underscore the potential of LLMs to analyze real-world psychological data effectively, offering a valuable foundation for interdisciplinary research at the intersection of artificial intelligence and psychology.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes