AICYOct 20, 2024

Exploring Social Desirability Response Bias in Large Language Models: Evidence from GPT-4 Simulations

arXiv:2410.15442v16 citationsh-index: 6
Originality Synthesis-oriented
AI Analysis

This addresses potential biases in LLM-based social simulations, which is important for researchers using these models, but it is incremental as it builds on existing bias studies.

The study investigated whether GPT-4 exhibits social desirability response bias in social survey simulations, finding mixed results: a commitment statement increased bias scores but reduced civic engagement scores, with limited impact on predictive performance.

Large language models (LLMs) are employed to simulate human-like responses in social surveys, yet it remains unclear if they develop biases like social desirability response (SDR) bias. To investigate this, GPT-4 was assigned personas from four societies, using data from the 2022 Gallup World Poll. These synthetic samples were then prompted with or without a commitment statement intended to induce SDR. The results were mixed. While the commitment statement increased SDR index scores, suggesting SDR bias, it reduced civic engagement scores, indicating an opposite trend. Additional findings revealed demographic associations with SDR scores and showed that the commitment statement had limited impact on GPT-4's predictive performance. The study underscores potential avenues for using LLMs to investigate biases in both humans and LLMs themselves.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes