Multiverse: Multilingual Evidence for Fake News Detection
This addresses the problem of limited language coverage in fake news detection for online platforms, though it is incremental by building on existing approaches.
The authors tackled fake news detection by proposing a multilingual evidence feature, which improved classification performance when combined with linguistic features on general-topic and COVID-19 news datasets.
Misleading information spreads on the Internet at an incredible speed, which can lead to irreparable consequences in some cases. It is becoming essential to develop fake news detection technologies. While substantial work has been done in this direction, one of the limitations of the current approaches is that these models are focused only on one language and do not use multilingual information. In this work, we propose Multiverse -- a new feature based on multilingual evidence that can be used for fake news detection and improve existing approaches. The hypothesis of the usage of cross-lingual evidence as a feature for fake news detection is confirmed, firstly, by manual experiment based on a set of known true and fake news. After that, we compared our fake news classification system based on the proposed feature with several baselines on two multi-domain datasets of general-topic news and one fake COVID-19 news dataset showing that in additional combination with linguistic features it yields significant improvements.