CLJun 11, 2025

Comparing human and LLM politeness strategies in free production

arXiv:2506.09391v213.98 citationsh-index: 1EMNLP

Originality Incremental advance

AI Analysis

This addresses the alignment challenge of polite speech in AI systems for human-AI interaction, though it is incremental in highlighting subtle differences.

The study investigated whether large language models (LLMs) use context-sensitive politeness strategies like humans, finding that models with ≥70B parameters replicate human preferences and are preferred by evaluators in open-ended tasks, but disproportionately rely on negative politeness strategies even in positive contexts.

Polite speech poses a fundamental alignment challenge for large language models (LLMs). Humans deploy a rich repertoire of linguistic strategies to balance informational and social goals -- from positive approaches that build rapport (compliments, expressions of interest) to negative strategies that minimize imposition (hedging, indirectness). We investigate whether LLMs employ a similarly context-sensitive repertoire by comparing human and LLM responses in both constrained and open-ended production tasks. We find that larger models ($\ge$70B parameters) successfully replicate key preferences from the computational pragmatics literature, and human evaluators surprisingly prefer LLM-generated responses in open-ended contexts. However, further linguistic analyses reveal that models disproportionately rely on negative politeness strategies even in positive contexts, potentially leading to misinterpretations. While modern LLMs demonstrate an impressive handle on politeness strategies, these subtle differences raise important questions about pragmatic alignment in AI systems.

View on arXiv PDF

Similar