CRAINov 20, 2023

Assessing Prompt Injection Risks in 200+ Custom GPTs

arXiv:2311.11538v284 citationsh-index: 16
Originality Synthesis-oriented
AI Analysis

This addresses security and privacy risks for users and developers of customizable AI models, though it is incremental as it builds on known prompt injection issues in a new context.

The study assessed prompt injection vulnerabilities in over 200 user-customized GPT models, finding that adversaries could extract system prompts and access uploaded files, highlighting significant security risks.

In the rapidly evolving landscape of artificial intelligence, ChatGPT has been widely used in various applications. The new feature - customization of ChatGPT models by users to cater to specific needs has opened new frontiers in AI utility. However, this study reveals a significant security vulnerability inherent in these user-customized GPTs: prompt injection attacks. Through comprehensive testing of over 200 user-designed GPT models via adversarial prompts, we demonstrate that these systems are susceptible to prompt injections. Through prompt injection, an adversary can not only extract the customized system prompts but also access the uploaded files. This paper provides a first-hand analysis of the prompt injection, alongside the evaluation of the possible mitigation of such attacks. Our findings underscore the urgent need for robust security frameworks in the design and deployment of customizable GPT models. The intent of this paper is to raise awareness and prompt action in the AI community, ensuring that the benefits of GPT customization do not come at the cost of compromised security and privacy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes