IR CLOct 12, 2024

Yesterday's News: Benchmarking Multi-Dimensional Out-of-Distribution Generalization of Misinformation Detection Models

Ivo Verhoeven, Pushkar Mishra, Ekaterina Shutova

Meta AI

arXiv:2410.18122v34.02 citationsh-index: 37Has CodeComputational Linguistics

Originality Synthesis-oriented

AI Analysis

This addresses the challenge for moderators and researchers in detecting rapidly evolving misinformation, though it is incremental as it focuses on benchmarking rather than proposing new methods.

The authors tackled the problem of misinformation detection models' inability to generalize out-of-distribution by introducing misinfo-general, a benchmark dataset that simulates covariate shifts across axes like time and topic, showing that baseline models fail to meet desired generalization criteria.

This article introduces misinfo-general, a benchmark dataset for evaluating misinformation models' ability to perform out-of-distribution generalization. Misinformation changes rapidly, much more quickly than moderators can annotate at scale, resulting in a shift between the training and inference data distributions. As a result, misinformation detectors need to be able to perform out-of-distribution generalization, an attribute they currently lack. Our benchmark uses distant labelling to enable simulating covariate shifts in misinformation content. We identify time, event, topic, publisher, political bias, misinformation type as important axes for generalization, and we evaluate a common class of baseline models on each. Using article metadata, we show how this model fails desiderata, which is not necessarily obvious from classification metrics. Finally, we analyze properties of the data to ensure limited presence of modelling shortcuts. We make the dataset and accompanying code publicly available: https://github.com/ioverho/misinfo-general

View on arXiv PDF Code

Similar