CRAIApr 2

Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud

arXiv:2604.0495110.7h-index: 4
Predicted impact top 99% in CR · last 90 daysOriginality Incremental advance
AI Analysis

For cybersecurity practitioners and researchers, it reframes the attack surface from synthetic media detection to human decision-making, addressing a critical gap in current defenses against AI-powered fraud.

The paper formalizes Synthetic Trust Attacks (STAs) as a new threat category where generative AI manipulates human decisions in social engineering fraud, showing that human deepfake detection accuracy is only ~55.5% and LLM scam agents achieve 46% compliance vs. 18% for humans. It proposes an eight-stage operational framework (STAM) and a decision-layer defense protocol.

Imagine receiving a video call from your CFO, surrounded by colleagues, asking you to urgently authorise a confidential transfer. You comply. Every person on that call was fake, and you just lost $25 million. This is not a hypothetical. It happened in Hong Kong in January 2024, and it is becoming the template for a new generation of fraud. AI has not invented a new crime. It has industrialised an ancient one: the manufacture of trust. This paper proposes Synthetic Trust Attacks (STAs) as a formal threat category and introduces STAM, the Synthetic Trust Attack Model, an eight-stage operational framework covering the full attack chain from adversary reconnaissance through post-compliance leverage. The core argument is this: existing defenses target synthetic media detection, but the real attack surface is the victim's decision. When human deepfake detection accuracy sits at approximately 55.5%, barely above chance, and LLM scam agents achieve 46% compliance versus 18% for human operators while evading safety filters entirely, the perception layer has already failed. Defense must move to the decision layer. We present a five-category Trust-Cue Taxonomy, a reproducible 17-field Incident Coding Schema with a pilot-coded example, and four falsifiable hypotheses linking attack structure to compliance outcomes. The paper further operationalizes the author's practitioner-developed Calm, Check, Confirm protocol as a research-grade decision-layer defense. Synthetic credibility, not synthetic media, is the true attack surface of the AI fraud era.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes