LGJan 21

Multimodal Rumor Detection Enhanced by External Evidence and Forgery Features

arXiv:2601.14954v1h-index: 1
Originality Incremental advance
AI Analysis

This work addresses the challenge of detecting deceptive image-text posts on social media, which is an incremental improvement for online safety and public opinion management.

The paper tackled the problem of detecting multimodal rumors on social media by addressing limitations in existing methods, such as limited feature extraction and ignoring external evidence, and achieved improved performance with a model that outperformed baselines in accuracy, recall, and F1 score on Weibo and Twitter datasets.

Social media increasingly disseminates information through mixed image text posts, but rumors often exploit subtle inconsistencies and forged content, making detection based solely on post content difficult. Deep semantic mismatch rumors, which superficially align images and texts, pose particular challenges and threaten online public opinion. Existing multimodal rumor detection methods improve cross modal modeling but suffer from limited feature extraction, noisy alignment, and inflexible fusion strategies, while ignoring external factual evidence necessary for verifying complex rumors. To address these limitations, we propose a multimodal rumor detection model enhanced with external evidence and forgery features. The model uses a ResNet34 visual encoder, a BERT text encoder, and a forgery feature module extracting frequency-domain traces and compression artifacts via Fourier transformation. BLIP-generated image descriptions bridge image and text semantic spaces. A dual contrastive learning module computes contrastive losses between text image and text description pairs, improving detection of semantic inconsistencies. A gated adaptive feature-scaling fusion mechanism dynamically adjusts multimodal fusion and reduces redundancy. Experiments on Weibo and Twitter datasets demonstrate that our model outperforms mainstream baselines in macro accuracy, recall, and F1 score.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes