CLJul 2, 2024

Fake News Detection: It's All in the Data!

arXiv:2407.02122v215 citationsh-index: 31
Originality Synthesis-oriented
AI Analysis

It addresses the problem of dataset limitations for researchers in fake news detection, but is incremental as a survey and resource compilation.

This survey examines how dataset quality and diversity affect fake news detection models, providing a GitHub repository that consolidates publicly available datasets to facilitate research.

This comprehensive survey serves as an indispensable resource for researchers embarking on the journey of fake news detection. By highlighting the pivotal role of dataset quality and diversity, it underscores the significance of these elements in the effectiveness and robustness of detection models. The survey meticulously outlines the key features of datasets, various labeling systems employed, and prevalent biases that can impact model performance. Additionally, it addresses critical ethical issues and best practices, offering a thorough overview of the current state of available datasets. Our contribution to this field is further enriched by the provision of GitHub repository, which consolidates publicly accessible datasets into a single, user-friendly portal. This repository is designed to facilitate and stimulate further research and development efforts aimed at combating the pervasive issue of fake news.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes