HCMay 18

Distorted Perspectives of LLM-Simulated Preferences: Can AI Mislead Design?

arXiv:2605.1831142.4
Predicted impact top 45% in HC · last 90 daysOriginality Incremental advance
AI Analysis

For designers using LLMs to inform user experience decisions, this reveals that current LLM simulations can misrepresent user preferences, highlighting a critical gap in AI-assisted design.

LLM-simulated design preferences significantly and systematically diverge from real user preferences across 29 real-world tests (n=2073), with synthetic justifications lacking depth and nuance.

Designers of digital solutions increasingly consult Large Language Models (LLMs) for their work. However, it remains unclear how this may affect the user experiences they produce and there are no established practices. We investigate how design preferences expressed by LLM-driven simulation methods align with those of real users. We present a study that aggregates real-world data and design stimuli from twenty-nine preference tests conducted in practice by users of the UXtweak online research platform (n = 2073). We perform holistic multimodal simulations where we manipulate LLM variables (model reasoning, sampling, persona type, and specificity) and assess their effects on algorithmic fidelity. Our results unveil significant and systematic discrepancies between peoples' real design preferences and LLM simulations that are consistent across manipulations. Synthetic justifications lack genuine depth, nuance and reasoning, which they substitute by patterns like focus on generic properties, specific elements, elaboration and overpraising. The unique attention directed by this research toward preferences within visual design stimuli highlights misrepresentation of perception and meaning by LLMs in a context that is intuitive yet critical for design teams. The external and ecological validity of our findings is high, given their replication across a multitude of real-world studies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes