CRMay 12

TM-RUGPULL: A Temporary Sound, Multimodal Dataset for Early Detection of RUG Pulls Across the Tokenized Ecosystem

arXiv:2602.215299.01 citationsh-index: 9
AI Analysis

Provides a scientific-grade, leakage-resistant dataset to address the lack of reliable resources for early rug-pull detection research in blockchain ecosystems.

The paper introduces TM-RugPull, a multimodal dataset of 1,028 token projects with strict temporal hygiene and expert-verified labels, enabling causally valid early detection of rug-pull attacks across DeFi, meme coins, NFTs, and celebrity tokens.

Rug-pull attacks pose a systemic threat across the blockchain ecosystem, yet research into early detection is hindered by the lack of scientific-grade datasets. Existing resources often suffer from temporal data leakage, narrow modality, and ambiguous labeling, particularly outside DeFi contexts. To address these limitations, we present TM-RugPull, a rigorously curated, leakage-resistant dataset of 1,028 token projects spanning DeFi, meme coins, NFTs, and celebrity-themed tokens. RugPull enforces strict temporal hygiene by extracting all features on chain behavior, smart contract metadata, and OSINT signals strictly from the first half of each project's lifespan. Labels are grounded in forensic reports and longevity criteria, verified through multi-expert consensus. This dataset enables causally valid, multimodal analysis of rug-pull dynamics and establishes a new benchmark for reproducible fraud detection research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes