LGAIFeb 28, 2025

LLM-Empowered Class Imbalanced Graph Prompt Learning for Online Drug Trafficking Detection

arXiv:2503.01900v112 citationsh-index: 24ACL
Originality Incremental advance
AI Analysis

This work addresses a critical social issue of illicit drug trafficking detection for online platforms, but it is incremental as it builds on existing graph neural network and prompt learning methods.

The paper tackles the problem of detecting online drug trafficking on platforms like Twitter by addressing class imbalance and label scarcity, proposing LLM-HetGDT, which uses LLMs to augment minority classes and fine-tune prompts, achieving effective detection as shown on a new dataset.

As the market for illicit drugs remains extremely profitable, major online platforms have become direct-to-consumer intermediaries for illicit drug trafficking participants. These online activities raise significant social concerns that require immediate actions. Existing approaches to combating this challenge are generally impractical, due to the imbalance of classes and scarcity of labeled samples in real-world applications. To this end, we propose a novel Large Language Model-empowered Heterogeneous Graph Prompt Learning framework for illicit Drug Trafficking detection, called LLM-HetGDT, that leverages LLM to facilitate heterogeneous graph neural networks (HGNNs) to effectively identify drug trafficking activities in the class-imbalanced scenarios. Specifically, we first pre-train HGNN over a contrastive pretext task to capture the inherent node and structure information over the unlabeled drug trafficking heterogeneous graph (HG). Afterward, we employ LLM to augment the HG by generating high-quality synthetic user nodes in minority classes. Then, we fine-tune the soft prompts on the augmented HG to capture the important information in the minority classes for the downstream drug trafficking detection task. To comprehensively study online illicit drug trafficking activities, we collect a new HG dataset over Twitter, called Twitter-HetDrug. Extensive experiments on this dataset demonstrate the effectiveness, efficiency, and applicability of LLM-HetGDT.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes