AIMay 11

A Reflective Storytelling Agent for Older Adults: Integrating Argumentation Schemes and Argument Mining in LLM-Based Personalised Narratives

Jayalakshmi Baskar, Vera C. Kaelin, Kaan Kilic, Helena Lindgren

arXiv:2605.1053110.7

Predicted impact top 73% in AI · last 90 daysOriginality Incremental advance

AI Analysis

This work addresses the need for transparent and purposeful narrative interaction in digital companions for older adults, offering a method to inspect and improve LLM-generated stories.

The paper presents a reflective storytelling agent for older adults that integrates knowledge graphs, user modeling, argumentation theory, and argument mining to guide LLM-based narrative generation. In a study with 55 older adults, participants recognized personally relevant purposes in roughly two-thirds of narratives, and higher argument-quality indicators correlated with higher clarity and meaningfulness ratings.

This work investigates whether knowledge-driven large language model (LLM)-based storytelling can support purposeful narrative interaction with a digital companion for older adults. To address known limitations of LLMs, including hallucinations and limited transparency, we present a reflective storytelling agent integrating knowledge graphs, user modelling, argumentation theory, and argument mining to guide and inspect narrative generation. The study consisted of two phases. Phase I employed participatory design involving 11 domain experts in a formative evaluation that informed iterative refinement. The resulting system generates narratives grounded in structured user models representing health-promoting activities and motivations. Phase II involved 55 older adults evaluating persona-based narratives across four prompts and two creativity levels. Participants assessed perceived purpose, usefulness, cultural relatability, and inconsistencies. The system additionally computed hallucination-risk indicators to evaluate generated narratives. Participants recognised personally relevant purposes in roughly two thirds of narratives, while argument-based purposes were identified in around half of these cases. Cultural recognisability strongly influenced willingness to use the functionality, whereas minor inconsistencies were often tolerated when narratives remained understandable and personally relevant. Narratives with higher hallucination-risk indicators were more often perceived as inconsistent, while higher argument-quality indicators tended to co-occur with higher clarity and meaningfulness ratings. Overall, the study positions argument mining as a reflective inspection mechanism for comparing formal grounding signals with human evaluations in health-oriented LLM storytelling for older adults.

View on arXiv PDF

Similar