CL AIJan 26, 2025

Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection

Bo Yang, Jiaxian Guo, Yusuke Iwasawa, Yutaka Matsuo

arXiv:2501.15355v113.98 citationsh-index: 20

Originality Incremental advance

AI Analysis

This work addresses improving social interaction simulation in AI agents, representing an incremental advancement in applying theory of mind to large language models.

The researchers tackled the challenge of enhancing large language models' theory of mind capabilities in conversational agents by proposing ToM-agent, which disentangles confidence from mental states and uses counterfactual reflection. Their method improved performance on empathetic and persuasion dialogue tasks, showing better understanding of counterpart behaviors beyond semantic-emotional support.

Recent studies have increasingly demonstrated that large language models (LLMs) possess significant theory of mind (ToM) capabilities, showing the potential for simulating the tracking of mental states in generative agents. In this study, we propose a novel paradigm called ToM-agent, designed to empower LLMs-based generative agents to simulate ToM in open-domain conversational interactions. ToM-agent disentangles the confidence from mental states, facilitating the emulation of an agent's perception of its counterpart's mental states, such as beliefs, desires, and intentions (BDIs). Using past conversation history and verbal reflections, ToM-Agent can dynamically adjust counterparts' inferred BDIs, along with related confidence levels. We further put forth a counterfactual intervention method that reflects on the gap between the predicted responses of counterparts and their real utterances, thereby enhancing the efficiency of reflection. Leveraging empathetic and persuasion dialogue datasets, we assess the advantages of implementing the ToM-agent with downstream tasks, as well as its performance in both the first-order and the \textit{second-order} ToM. Our findings indicate that the ToM-agent can grasp the underlying reasons for their counterpart's behaviors beyond mere semantic-emotional supporting or decision-making based on common sense, providing new insights for studying large-scale LLMs-based simulation of human social behaviors.

View on arXiv PDF

Similar