Probing Cultural Awareness in LLMs: A Case Study of Cross-Culture Aesthetic Stylistics
For researchers and developers deploying LLMs in culturally diverse contexts, this work reveals a gap in LLMs' cultural awareness, specifically in aesthetic stylistics, which is incremental.
LLMs show limited sensitivity to Hong Kong-specific stylistic structure, relying on surface-level linguistic cues rather than deeper stylistic patterns in cross-cultural aesthetic stylistics tasks.
Large Language Models (LLMs) are increasingly deployed in diverse cultural contexts, yet their ability to master aesthetic stylistics, i.e., the strategic use of language to evoke cultural resonance, remains underexplored. We curate C4STYLI, a benchmark of highly stylized translated movie titles and advertising slogans from Hong Kong and the Chinese Mainland, to evaluate LLMs via the lens of behavioral recognition and productive competence. Extensive evaluations show that LLMs differ from humans in stylistic recognition, and this recognition ability varies across text domains. In addition, stylistic recognition and generation performance in LLMs are not consistently aligned. To further examine whether LLMs genuinely capture stylistic information in stylistic recognition, we conduct structural ablation with logistic regression probes. We find that, in the Hong Kong setting, stylistic recognition in LLMs relies primarily on surface-level linguistic information rather than stylistic structure. This suggests limited sensitivity to Hong Kong-specific stylistic structure.