CL LGMay 1, 2022

Detecting COVID-19 Conspiracy Theories with Transformers and TF-IDF

Haoming Guo, Tianyi Huang, Huixuan Huang, Mingyue Fan, Gerald Friedland

arXiv:2205.00377v10.83 citationsh-index: 30

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of detecting domain-specific fake news for social media platforms, but it is incremental as it applies existing methods to new data.

The paper tackled the detection of COVID-19 conspiracy theories by applying machine learning models to the MediaEval 2021 benchmark, finding that a pre-trained transformer achieved the best validation results, with a randomly initialized transformer reaching close accuracies.

The sharing of fake news and conspiracy theories on social media has wide-spread negative effects. By designing and applying different machine learning models, researchers have made progress in detecting fake news from text. However, existing research places a heavy emphasis on general, common-sense fake news, while in reality fake news often involves rapidly changing topics and domain-specific vocabulary. In this paper, we present our methods and results for three fake news detection tasks at MediaEval benchmark 2021 that specifically involve COVID-19 related topics. We experiment with a group of text-based models including Support Vector Machines, Random Forest, BERT, and RoBERTa. We find that a pre-trained transformer yields the best validation results, but a randomly initialized transformer with smart design can also be trained to reach accuracies close to that of the pre-trained transformer.

View on arXiv PDF

Similar