CLLGOct 24, 2023

Do Stochastic Parrots have Feelings Too? Improving Neural Detection of Synthetic Text via Emotion Recognition

arXiv:2310.15904v1131 citationsh-index: 2Has Code
Originality Incremental advance
AI Analysis

This addresses the urgent need for powerful synthetic text detection technologies, which is crucial for combating misinformation and ensuring content authenticity, though it is incremental as it builds on existing emotion recognition methods.

The paper tackles the problem of detecting synthetic text by hypothesizing that pretrained language models lack emotional drivers, leading to affective incoherence, and develops an emotionally-aware detector that shows improvements across various generators, models, datasets, and domains, with substantial gains over ChatGPT in identifying its own output.

Recent developments in generative AI have shone a spotlight on high-performance synthetic text generation technologies. The now wide availability and ease of use of such models highlights the urgent need to provide equally powerful technologies capable of identifying synthetic text. With this in mind, we draw inspiration from psychological studies which suggest that people can be driven by emotion and encode emotion in the text they compose. We hypothesize that pretrained language models (PLMs) have an affective deficit because they lack such an emotional driver when generating text and consequently may generate synthetic text which has affective incoherence i.e. lacking the kind of emotional coherence present in human-authored text. We subsequently develop an emotionally aware detector by fine-tuning a PLM on emotion. Experiment results indicate that our emotionally-aware detector achieves improvements across a range of synthetic text generators, various sized models, datasets, and domains. Finally, we compare our emotionally-aware synthetic text detector to ChatGPT in the task of identification of its own output and show substantial gains, reinforcing the potential of emotion as a signal to identify synthetic text. Code, models, and datasets are available at https: //github.com/alanagiasi/emoPLMsynth

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes