HCMay 10

Exploring a Multimodal Chatbot as a Facilitator in Therapeutic Art Activity

arXiv:2602.1418373.7h-index: 5
AI Analysis

For art therapists and researchers, this work explores AI-mediated therapeutic support, but it is an early-stage, incremental study with a small expert evaluation.

The paper introduces an MLLM-powered chatbot that analyzes visual art in real-time and engages users in reflective conversations to facilitate therapeutic art activities. Evaluation with five experts showed potential for therapeutic engagement but highlighted areas for improvement such as risk management and personalization.

Therapeutic art activities, such as expressive drawing and painting, require the synergy between creative visual production and interactive dialogue. Recent advancements in Multimodal Large Language Models (MLLMs) have expanded the capacity of computing systems to interpret both textual and visual data, offering a new frontier for AI-mediated therapeutic support. This work-in-progress paper introduces an MLLM-powered chatbot that analyzes visual creation in real-time while engaging the creator in reflective conversations. We conducted an evaluation with five experts in art therapy and related fields, which demonstrated the chatbot's potential to facilitate therapeutic engagement, and highlighted several areas for future development, including entryways and risk management, bespoke alignment of user profile and therapeutic style, balancing conversational depth and width, and enriching visual interactivity. These themes provide a design roadmap for designing the future AI-mediated creative expression tools.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes