CLMar 26

An Experimental Comparison of the Most Popular Approaches to Fake News Detection

Pietro Dell'Oglio, Alessandro Bondielli, Francesco Marcelloni, Lucia C. Passaro

arXiv:2603.2550176.5h-index: 5

Predicted impact top 79% in CL · last 90 daysOriginality Synthesis-oriented

AI Analysis

This work addresses the challenge of robust fake news detection for researchers and practitioners, but it is incremental as it focuses on benchmarking existing methods rather than introducing new ones.

The paper tackled the problem of fake news detection by experimentally comparing 12 popular approaches across 10 diverse datasets, finding that fine-tuned models perform well in-domain but struggle to generalize, while cross-domain architectures and LLMs offer improvements but with limitations.

In recent years, fake news detection has received increasing attention in public debate and scientific research. Despite advances in detection techniques, the production and spread of false information have become more sophisticated, driven by Large Language Models (LLMs) and the amplification power of social media. We present a critical assessment of 12 representative fake news detection approaches, spanning traditional machine learning, deep learning, transformers, and specialized cross-domain architectures. We evaluate these methods on 10 publicly available datasets differing in genre, source, topic, and labeling rationale. We address text-only English fake news detection as a binary classification task by harmonizing labels into "Real" and "Fake" to ensure a consistent evaluation protocol. We acknowledge that label semantics vary across datasets and that harmonization inevitably removes such semantic nuances. Each dataset is treated as a distinct domain. We conduct in-domain, multi-domain and cross-domain experiments to simulate real-world scenarios involving domain shift and out-of-distribution data. Fine-tuned models perform well in-domain but struggle to generalize. Cross-domain architectures can reduce this gap but are data-hungry, while LLMs offer a promising alternative through zero- and few-shot learning. Given inherent dataset confounds and possible pre-training exposure, results should be interpreted as robustness evaluations within this English, text-only protocol.

View on arXiv PDF

Similar