CRAug 25, 2025
Tricking LLM-Based NPCs into Spilling SecretsKyohei Shiomi, Zhuotao Lian, Toru Nakanishi et al.
Large Language Models (LLMs) are increasingly used to generate dynamic dialogue for game NPCs. However, their integration raises new security concerns. In this study, we examine whether adversarial prompt injection can cause LLM-based NPCs to reveal hidden background secrets that are meant to remain undisclosed.