CLFeb 18, 2025

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

arXiv:2502.12904v235 citationsh-index: 14ACL
Originality Incremental advance
AI Analysis

This addresses the need for better assessment of LLMs' fraud detection capabilities in dynamic, real-world scenarios, but it is incremental as it builds on existing benchmarking efforts.

The authors tackled the problem of evaluating LLMs' robustness against internet fraud and phishing by introducing Fraud-R1, a multi-round benchmark with 8,564 cases across 5 fraud types, and found significant challenges, especially in role-play settings and fake job postings, with a performance gap between Chinese and English.

We introduce Fraud-R1, a benchmark designed to evaluate LLMs' ability to defend against internet fraud and phishing in dynamic, real-world scenarios. Fraud-R1 comprises 8,564 fraud cases sourced from phishing scams, fake job postings, social media, and news, categorized into 5 major fraud types. Unlike previous benchmarks, Fraud-R1 introduces a multi-round evaluation pipeline to assess LLMs' resistance to fraud at different stages, including credibility building, urgency creation, and emotional manipulation. Furthermore, we evaluate 15 LLMs under two settings: 1. Helpful-Assistant, where the LLM provides general decision-making assistance, and 2. Role-play, where the model assumes a specific persona, widely used in real-world agent-based interactions. Our evaluation reveals the significant challenges in defending against fraud and phishing inducement, especially in role-play settings and fake job postings. Additionally, we observe a substantial performance gap between Chinese and English, underscoring the need for improved multilingual fraud detection capabilities.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes