CRMAApr 15

SoK: Security of Autonomous LLM Agents in Agentic Commerce

arXiv:2604.1536792.51 citationsh-index: 2
Predicted impact top 4% in CR · last 90 daysOriginality Incremental advance
AI Analysis

Provides a systematic security framework for the emerging field of agentic commerce, addressing a gap in existing security frameworks for autonomous LLM agents in financial transactions.

This SoK develops a unified security framework for autonomous LLM agents in commerce, identifying 12 cross-layer attack vectors and proposing a layered defense architecture. It shows that securing agentic commerce requires coordinated controls across LLM safety, protocol design, identity, market structure, and regulation.

Autonomous large language model (LLM) agents such as OpenClaw are pushing agentic commerce from human-supervised assistance toward machine actors that can negotiate, purchase services, manage digital assets, and execute transactions across on-chain and off-chain environments. Protocols such as the Trustless Agents standard (ERC-8004), Agent Payments Protocol (AP2), the HTTP 402-based payment protocol (x402), Agent Commerce Protocol (ACP), the Agentic Commerce standard (ERC-8183), and Machine Payments Protocol (MPP) enable this transition, but they also create an attack surface that existing security frameworks do not capture well. This Systematization of Knowledge (SoK) develops a unified security framework for autonomous LLM agents in commerce and finance. We organize threats along five dimensions: agent integrity, transaction authorization, inter-agent trust, market manipulation, and regulatory compliance. From a systematically curated public corpus of academic papers, protocol documents, industry reports, and incident evidence, we derive 12 cross-layer attack vectors and show how failures propagate from reasoning and tooling layers into custody, settlement, market harm, and compliance exposure. We then propose a layered defense architecture addressing authorization gaps left by current agent-payment protocols. Overall, our analysis shows that securing agentic commerce is inherently a cross-layer problem that requires coordinated controls across LLM safety, protocol design, identity, market structure, and regulation. We conclude with a research roadmap and a benchmark agenda for secure autonomous commerce.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes