CYAIJul 2, 2025

Penalizing Transparency? How AI Disclosure and Author Demographics Shape Human and AI Judgments About Writing

arXiv:2507.01418v19 citationsh-index: 57
Originality Incremental advance
AI Analysis

It addresses the problem of asymmetric burdens in AI transparency for marginalized groups, with implications for fairness in hiring and content algorithms, though it is incremental in exploring interactions between disclosure and identity.

This study investigated how AI disclosure statements affect perceptions of writing quality and whether these effects vary by author race and gender, finding that both human and LLM raters penalize disclosed AI use, but only LLMs show demographic biases where advantages for women or Black authors disappear when AI assistance is revealed.

As AI integrates in various types of human writing, calls for transparency around AI assistance are growing. However, if transparency operates on uneven ground and certain identity groups bear a heavier cost for being honest, then the burden of openness becomes asymmetrical. This study investigates how AI disclosure statement affects perceptions of writing quality, and whether these effects vary by the author's race and gender. Through a large-scale controlled experiment, both human raters (n = 1,970) and LLM raters (n = 2,520) evaluated a single human-written news article while disclosure statements and author demographics were systematically varied. This approach reflects how both human and algorithmic decisions now influence access to opportunities (e.g., hiring, promotion) and social recognition (e.g., content recommendation algorithms). We find that both human and LLM raters consistently penalize disclosed AI use. However, only LLM raters exhibit demographic interaction effects: they favor articles attributed to women or Black authors when no disclosure is present. But these advantages disappear when AI assistance is revealed. These findings illuminate the complex relationships between AI disclosure and author identity, highlighting disparities between machine and human evaluation patterns.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes