CLAIMar 19, 2025

Exploring Large Language Models for Word Games:Who is the Spy?

arXiv:2503.15235v14 citationsh-index: 1Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of applying LLMs to structured word games for NLP and game theory research, though it is incremental as it adapts existing methods to a specific game context.

The study tackled the problem of enabling large language models (LLMs) to play the word game 'Who is the Spy' by proposing a training-free, Chain-of-Thought-based scheduling framework, resulting in notable performance improvements across multiple datasets as measured by game success rates and analytical accuracy.

Word games hold significant research value for natural language processing (NLP), game theory, and related fields due to their rule-based and situational nature. This study explores how large language models (LLMs) can be effectively involved in word games and proposes a training-free framework. "Shei Shi Wo Di" or "Who is the Spy" in English, is a classic word game. Using this game as an example, we introduce a Chain-of-Thought (CoT)-based scheduling framework to enable LLMs to achieve excellent performance in tasks such as inferring role words and disguising their identities. We evaluate the framework's performance based on game success rates and the accuracy of the LLM agents' analytical results. Experimental results affirm the framework's effectiveness, demonstrating notable improvements in LLM performance across multiple datasets. This work highlights the potential of LLMs in mastering situational reasoning and social interactions within structured game environments. Our code is publicly available at https://github.com/ct-wei/Who-is-The-Spy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes