CLCRJan 30

Autonomous Chain-of-Thought Distillation for Graph-Based Fraud Detection

arXiv:2601.22949v1h-index: 8
Originality Incremental advance
AI Analysis

This addresses fraud detection in financial or security domains with text-attributed graphs, offering significant performance and efficiency gains, though it builds incrementally on existing LLM-GNN approaches.

The paper tackles the problem of graph-based fraud detection on text-attributed graphs by proposing FraudCoT, a framework that integrates autonomous chain-of-thought reasoning with LLM-GNN co-training. The result is up to 8.8% AUPRC improvement over state-of-the-art methods and up to 1,066x training speedup.

Graph-based fraud detection on text-attributed graphs (TAGs) requires jointly modeling rich textual semantics and relational dependencies. However, existing LLM-enhanced GNN approaches are constrained by predefined prompting and decoupled training pipelines, limiting reasoning autonomy and weakening semantic-structural alignment. We propose FraudCoT, a unified framework that advances TAG-based fraud detection through autonomous, graph-aware chain-of-thought (CoT) reasoning and scalable LLM-GNN co-training. To address the limitations of predefined prompts, we introduce a fraud-aware selective CoT distillation mechanism that generates diverse reasoning paths and enhances semantic-structural understanding. These distilled CoTs are integrated into node texts, providing GNNs with enriched, multi-hop semantic and structural cues for fraud detection. Furthermore, we develop an efficient asymmetric co-training strategy that enables end-to-end optimization while significantly reducing the computational cost of naive joint training. Extensive experiments on public and industrial benchmarks demonstrate that FraudCoT achieves up to 8.8% AUPRC improvement over state-of-the-art methods and delivers up to 1,066x speedup in training throughput, substantially advancing both detection performance and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes