CL AIMar 24

Avoiding Over-smoothing in Social Media Rumor Detection with Pre-trained Propagation Tree Transformer

arXiv:2603.2285477.2h-index: 4

AI Analysis

This addresses a key bottleneck in social media rumor detection for researchers and practitioners, offering a novel approach to improve accuracy.

The paper tackles the problem of over-smoothing in rumor detection on social media by proposing a Pre-trained Propagation Tree Transformer (P2T3) method, which outperforms state-of-the-art methods on multiple benchmark datasets and performs well in few-shot conditions.

Deep learning techniques for rumor detection typically utilize Graph Neural Networks (GNNs) to analyze post relations. These methods, however, falter due to over-smoothing issues when processing rumor propagation structures, leading to declining performance. Our investigation into this issue reveals that over-smoothing is intrinsically tied to the structural characteristics of rumor propagation trees, in which the majority of nodes are 1-level nodes. Furthermore, GNNs struggle to capture long-range dependencies within these trees. To circumvent these challenges, we propose a Pre-Trained Propagation Tree Transformer (P2T3) method based on pure Transformer architecture. It extracts all conversation chains from a tree structure following the propagation direction of replies, utilizes token-wise embedding to infuse connection information and introduces necessary inductive bias, and pre-trains on large-scale unlabeled datasets. Experiments indicate that P2T3 surpasses previous state-of-the-art methods in multiple benchmark datasets and performs well under few-shot conditions. P2T3 not only avoids the over-smoothing issue inherent in GNNs but also potentially offers a large model or unified multi-modal scheme for future social media research.

View on arXiv PDF

Similar