CLDec 17, 2024

Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health

arXiv:2412.12981v17 citationsh-index: 38NLPAICS
Originality Synthesis-oriented
AI Analysis

This work addresses data and bias issues for researchers and practitioners in mental health, but it is incremental as it builds on existing datasets and methods.

The paper tackles challenges like bias and data scarcity in using large language models (LLMs) for mental health by introducing IC-AnnoMI, an expert-annotated dataset for motivational interviewing, and finds that it improves model performance in classification tasks, though specific numerical gains are not detailed.

Large language models (LLMs) have shown promising capabilities in healthcare analysis but face several challenges like hallucinations, parroting, and bias manifestation. These challenges are exacerbated in complex, sensitive, and low-resource domains. Therefore, in this work we introduce IC-AnnoMI, an expert-annotated motivational interviewing (MI) dataset built upon AnnoMI by generating in-context conversational dialogues leveraging LLMs, particularly ChatGPT. IC-AnnoMI employs targeted prompts accurately engineered through cues and tailored information, taking into account therapy style (empathy, reflection), contextual relevance, and false semantic change. Subsequently, the dialogues are annotated by experts, strictly adhering to the Motivational Interviewing Skills Code (MISC), focusing on both the psychological and linguistic dimensions of MI dialogues. We comprehensively evaluate the IC-AnnoMI dataset and ChatGPT's emotional reasoning ability and understanding of domain intricacies by modeling novel classification tasks employing several classical machine learning and current state-of-the-art transformer approaches. Finally, we discuss the effects of progressive prompting strategies and the impact of augmented data in mitigating the biases manifested in IC-AnnoM. Our contributions provide the MI community with not only a comprehensive dataset but also valuable insights for using LLMs in empathetic text generation for conversational therapy in supervised settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes