CLJun 11, 2025

Comparing human and LLM politeness strategies in free production

arXiv:2506.09391v28 citationsh-index: 1EMNLP
Originality Incremental advance
AI Analysis

This addresses the alignment challenge of polite speech in AI systems for human-AI interaction, though it is incremental in highlighting subtle differences.

The study investigated whether large language models (LLMs) use context-sensitive politeness strategies like humans, finding that models with ≥70B parameters replicate human preferences and are preferred by evaluators in open-ended tasks, but disproportionately rely on negative politeness strategies even in positive contexts.

Polite speech poses a fundamental alignment challenge for large language models (LLMs). Humans deploy a rich repertoire of linguistic strategies to balance informational and social goals -- from positive approaches that build rapport (compliments, expressions of interest) to negative strategies that minimize imposition (hedging, indirectness). We investigate whether LLMs employ a similarly context-sensitive repertoire by comparing human and LLM responses in both constrained and open-ended production tasks. We find that larger models ($\ge$70B parameters) successfully replicate key preferences from the computational pragmatics literature, and human evaluators surprisingly prefer LLM-generated responses in open-ended contexts. However, further linguistic analyses reveal that models disproportionately rely on negative politeness strategies even in positive contexts, potentially leading to misinterpretations. While modern LLMs demonstrate an impressive handle on politeness strategies, these subtle differences raise important questions about pragmatic alignment in AI systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes