CR AIAug 25, 2025

Tricking LLM-Based NPCs into Spilling Secrets

Kyohei Shiomi, Zhuotao Lian, Toru Nakanishi, Teruaki Kitasuka

arXiv:2508.19288v1h-index: 12ProvSec

Originality Synthesis-oriented

AI Analysis

This addresses security vulnerabilities in LLM-integrated gaming systems, though it appears incremental as it applies known adversarial techniques to a new context.

The study tackled the problem of adversarial prompt injection causing LLM-based game NPCs to reveal hidden secrets, finding that such attacks can successfully extract confidential information.

Large Language Models (LLMs) are increasingly used to generate dynamic dialogue for game NPCs. However, their integration raises new security concerns. In this study, we examine whether adversarial prompt injection can cause LLM-based NPCs to reveal hidden background secrets that are meant to remain undisclosed.

View on arXiv PDF

Similar