CRAISep 6, 2025

On the Security of Tool-Invocation Prompts for LLM-Based Agentic Systems: An Empirical Risk Assessment

arXiv:2509.05755v43 citationsh-index: 18
Originality Highly original
AI Analysis

This addresses a critical security problem for developers and users of LLM-based agentic systems in domains like chatbots and software engineering, highlighting a previously overlooked risk.

The paper tackles security vulnerabilities in Tool Invocation Prompts (TIPs) used in LLM-based agentic systems, revealing that major systems like Cursor and Claude Code are susceptible to attacks such as remote code execution and denial of service.

LLM-based agentic systems leverage large language models to handle user queries, make decisions, and execute external tools for complex tasks across domains like chatbots, customer service, and software engineering. A critical component of these systems is the Tool Invocation Prompt (TIP), which defines tool interaction protocols and guides LLMs to ensure the security and correctness of tool usage. Despite its importance, TIP security has been largely overlooked. This work investigates TIP-related security risks, revealing that major LLM-based systems like Cursor, Claude Code, and others are vulnerable to attacks such as remote code execution (RCE) and denial of service (DoS). Through a systematic TIP exploitation workflow (TEW), we demonstrate external tool behavior hijacking via manipulated tool invocations. We also propose defense mechanisms to enhance TIP security in LLM-based agentic systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes