CLAIOct 30, 2023

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models

arXiv:2310.19619v2152 citationsh-index: 16Has Code
Originality Synthesis-oriented
AI Analysis

This work tackles the problem of evaluating machine ToM for AI researchers, but it is incremental as it builds on existing psychological studies and benchmarks.

The paper addresses the lack of robust Theory of Mind (ToM) in Large Language Models by proposing a holistic taxonomy of 7 mental state categories and advocating for situated evaluation to mitigate shortcuts and data leakage in benchmarks, with a pilot study in a grid world as proof of concept.

Large Language Models (LLMs) have generated considerable interest and debate regarding their potential emergence of Theory of Mind (ToM). Several recent inquiries reveal a lack of robust ToM in these models and pose a pressing demand to develop new benchmarks, as current ones primarily focus on different aspects of ToM and are prone to shortcuts and data leakage. In this position paper, we seek to answer two road-blocking questions: (1) How can we taxonomize a holistic landscape of machine ToM? (2) What is a more effective evaluation protocol for machine ToM? Following psychological studies, we taxonomize machine ToM into 7 mental state categories and delineate existing benchmarks to identify under-explored aspects of ToM. We argue for a holistic and situated evaluation of ToM to break ToM into individual components and treat LLMs as an agent who is physically situated in environments and socially situated in interactions with humans. Such situated evaluation provides a more comprehensive assessment of mental states and potentially mitigates the risk of shortcuts and data leakage. We further present a pilot study in a grid world setup as a proof of concept. We hope this position paper can facilitate future research to integrate ToM with LLMs and offer an intuitive means for researchers to better position their work in the landscape of ToM. Project page: https://github.com/Mars-tin/awesome-theory-of-mind

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes