CVCLDec 16, 2021

Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation

arXiv:2112.08594v2629 citations
Originality Synthesis-oriented
AI Analysis

This addresses misinformation detection for high-public-significance domains, but it is incremental as it builds on existing CLIP models with new data and minor enhancements.

The paper tackled the problem of detecting out-of-context media, such as mis-captioned images on Twitter, focusing on topics like Climate Change, COVID-19, and Military Vehicles, and achieved an 11% detection improvement in a high precision regime over a strong baseline.

Detecting out-of-context media, such as "mis-captioned" images on Twitter, is a relevant problem, especially in domains of high public significance. In this work we aim to develop defenses against such misinformation for the topics of Climate Change, COVID-19, and Military Vehicles. We first present a large-scale multimodal dataset with over 884k tweets relevant to these topics. Next, we propose a detection method, based on the state-of-the-art CLIP model, that leverages automatically generated hard image-text mismatches. While this approach works well on our automatically constructed out-of-context tweets, we aim to validate its usefulness on data representative of the real world. Thus, we test it on a set of human-generated fakes created by mimicking in-the-wild misinformation. We achieve an 11% detection improvement in a high precision regime over a strong baseline. Finally, we share insights about our best model design and analyze the challenges of this emerging threat.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes