CLFeb 20, 2025

Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction

arXiv:2502.14171v53 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of human-like interaction in conversational AI, though it appears incremental as it builds on existing LLM capabilities.

The study tackled the problem of limited Theory of Mind (ToM) in LLM-based conversational agents by explicitly manipulating beliefs, desires, and intentions, resulting in improved response quality with win rates of 67% and 63% for 3B and 8B LLaMA models respectively.

Natural language interaction with agentic Artificial Intelligence (AI), driven by Large Language Models (LLMs), is expected to remain a dominant paradigm in the near future. While humans instinctively align their communication with mental states -- an ability known as Theory of Mind (ToM), current LLM powered systems exhibit significant limitations in this regard. This study examines the extent to which open source language models (LLaMA) can capture and preserve ToM related information and how effectively it contributes to consistent ToM reasoning in generated responses. We further investigate whether explicit manipulation of ToM related components, such as beliefs, desires, and intentions, can enhance response alignment. Experiments on two LLaMA 3 variants demonstrate that incorporating ToM informed alignment improves response quality, achieving win rates of 67 and 63 percent for the 3B and 8B models, respectively. These findings highlight the potential of ToM driven strategies to improve alignment in LLM based conversational agents.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes