CLMar 5, 2025

EnigmaToM: Improve LLMs' Theory-of-Mind Reasoning Capabilities with Neural Knowledge Base of Entity States

Hainiu Xu, Siya Qi, Jiazheng Li, Yuxiang Zhou, Jinhua Du, Caroline Catmur, Yulan He

arXiv:2503.03340v214.78 citationsh-index: 47Has CodeACL

Originality Incremental advance

AI Analysis

This addresses the problem of inefficient and limited high-order Theory-of-Mind reasoning in LLMs for applications requiring human-like interaction, representing a novel method for a known bottleneck.

The paper tackles the challenge of improving Large Language Models' Theory-of-Mind reasoning by introducing EnigmaToM, a neuro-symbolic framework that integrates a Neural Knowledge Base for entity states, resulting in significant improvements on benchmarks like ToMi, HiToM, and FANToM, especially in high-order reasoning scenarios.

Theory-of-Mind (ToM), the ability to infer others' perceptions and mental states, is fundamental to human interaction but remains challenging for Large Language Models (LLMs). While existing ToM reasoning methods show promise with reasoning via perceptual perspective-taking, they often rely excessively on off-the-shelf LLMs, reducing their efficiency and limiting their applicability to high-order ToM reasoning. To address these issues, we present EnigmaToM, a novel neuro-symbolic framework that enhances ToM reasoning by integrating a Neural Knowledge Base of entity states (Enigma) for (1) a psychology-inspired iterative masking mechanism that facilitates accurate perspective-taking and (2) knowledge injection that elicits key entity information. Enigma generates structured knowledge of entity states to build spatial scene graphs for belief tracking across various ToM orders and enrich events with fine-grained entity state details. Experimental results on ToMi, HiToM, and FANToM benchmarks show that EnigmaToM significantly improves ToM reasoning across LLMs of varying sizes, particularly excelling in high-order reasoning scenarios.

View on arXiv PDF Code

Similar