CLApr 25, 2025

Can Third-parties Read Our Emotions?

arXiv:2504.18673v15 citationsh-index: 12ACL
Originality Incremental advance
AI Analysis

This work addresses a critical gap in NLP for emotion recognition by highlighting annotation biases, which is incremental but important for improving model reliability in applications like mental health or social media analysis.

The study investigated whether third-party annotators can accurately capture authors' private emotions from text, finding significant limitations in both human and LLM annotations, with LLMs outperforming humans but still falling short of first-party labels.

Natural Language Processing tasks that aim to infer an author's private states, e.g., emotions and opinions, from their written text, typically rely on datasets annotated by third-party annotators. However, the assumption that third-party annotators can accurately capture authors' private states remains largely unexamined. In this study, we present human subjects experiments on emotion recognition tasks that directly compare third-party annotations with first-party (author-provided) emotion labels. Our findings reveal significant limitations in third-party annotations-whether provided by human annotators or large language models (LLMs)-in faithfully representing authors' private states. However, LLMs outperform human annotators nearly across the board. We further explore methods to improve third-party annotation quality. We find that demographic similarity between first-party authors and third-party human annotators enhances annotation performance. While incorporating first-party demographic information into prompts leads to a marginal but statistically significant improvement in LLMs' performance. We introduce a framework for evaluating the limitations of third-party annotations and call for refined annotation practices to accurately represent and model authors' private states.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes