CLApr 6, 2023

Towards Interpretable Mental Health Analysis with Large Language Models

arXiv:2304.03347v4158 citationsh-index: 25
Originality Incremental advance
AI Analysis

This work addresses the need for interpretable mental health analysis tools, though it is incremental in applying existing LLMs to this domain with enhanced prompting and evaluation.

The authors evaluated large language models (LLMs) like ChatGPT on mental health analysis across 11 datasets and 5 tasks, finding that while they show strong in-context learning, they lag behind task-specific methods, but careful prompt engineering and emotional cues improve performance, and ChatGPT generates explanations approaching human quality.

The latest large language models (LLMs) such as ChatGPT, exhibit strong capabilities in automated mental health analysis. However, existing relevant studies bear several limitations, including inadequate evaluations, lack of prompting strategies, and ignorance of exploring LLMs for explainability. To bridge these gaps, we comprehensively evaluate the mental health analysis and emotional reasoning ability of LLMs on 11 datasets across 5 tasks. We explore the effects of different prompting strategies with unsupervised and distantly supervised emotional information. Based on these prompts, we explore LLMs for interpretable mental health analysis by instructing them to generate explanations for each of their decisions. We convey strict human evaluations to assess the quality of the generated explanations, leading to a novel dataset with 163 human-assessed explanations. We benchmark existing automatic evaluation metrics on this dataset to guide future related works. According to the results, ChatGPT shows strong in-context learning ability but still has a significant gap with advanced task-specific methods. Careful prompt engineering with emotional cues and expert-written few-shot examples can also effectively improve performance on mental health analysis. In addition, ChatGPT generates explanations that approach human performance, showing its great potential in explainable mental health analysis.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes