CYCLApr 24

Recognition Without Authorization: LLMs and the Moral Order of Online Advice

arXiv:2604.2214356.9h-index: 7
Predicted impact top 27% in CY · last 90 daysOriginality Incremental advance
AI Analysis

For researchers and developers of LLM-based advisory systems, this work reveals a structural divergence in moral authorization that can suppress community-ratified directives.

This study compares LLM advice with community-endorsed advice on 11,565 Reddit posts, finding that LLMs identify similar dynamics but are less likely to authorize action, especially on high-consensus posts about abuse or safety threats, where they recommend exit at roughly half the human rate.

Large language models are increasingly used to mediate everyday interpersonal dilemmas, yet how their advisory defaults interact with the concentrated moral orders of specific communities remains poorly understood. This article compares four assistant-style LLMs with community-endorsed advice on 11,565 posts from r/relationship_advice, using the subreddit as a concentrated, vote-ratified moral formation whose prescriptive clarity makes divergence measurable. Across models, LLMs identify many of the same dynamics as human commenters, but are markedly less likely to convert that recognition into directive authorization for action. The gap is sharpest where community consensus is strongest: on high-consensus posts involving abuse or safety threats, models recommend exit at roughly half the human rate while maintaining elevated levels of hedging, validation, and therapeutic framing. The article describes this pattern as recognition without authorization: the capacity to register harm while withholding socially ratified permission for consequential action. This divergence is not incidental but structural: a portable advisory style that remains validating, risk-averse, and weakly directive across contexts. Safety alignment is one plausible contributor to this pattern, alongside training-data averaging and broader assistant design. The article argues that model divergence can be reframed from a technical error to a way of seeing what standardized assistant norms flatten when they encounter situated moral worlds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes