CL CY LGSep 28, 2020

Identifying Automatically Generated Headlines using Transformers

Antonis Maronikolakis, Hinrich Schutze, Mark Stevenson

arXiv:2009.13375v327.8728 citations

Originality Synthesis-oriented

AI Analysis

This addresses the spread of fake content online, which influences public opinion, though it is incremental as it applies existing methods to a new dataset.

The paper tackled the problem of identifying automatically generated headlines to combat misinformation, achieving 85.7% accuracy with transformers compared to 47.8% for humans.

False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible. In the not so distant future, identifying fake content generated by deep learning models will play a key role in protecting users from misinformation. To this end, a dataset containing human and computer-generated headlines was created and a user study indicated that humans were only able to identify the fake headlines in 47.8% of the cases. However, the most accurate automatic approach, transformers, achieved an overall accuracy of 85.7%, indicating that content generated from language models can be filtered out accurately.

View on arXiv PDF

Similar