CL AIFeb 9, 2025

Enhancing Depression Detection with Chain-of-Thought Prompting: From Emotion to Reasoning Using Large Language Models

Shiyu Teng, Jiaqing Liu, Rahul Kumar Jain, Shurong Chai, Ruibo Hou, Tomoko Tateyama, Lanfen Lin, Yen-wei Chen

arXiv:2502.05879v113.912 citationsh-index: 12EMBC

Originality Incremental advance

AI Analysis

This addresses the challenge of accurate and interpretable depression detection for mental health applications, though it appears incremental as it builds on existing LLM methods with a structured prompting technique.

The paper tackles the problem of nuanced symptom identification and lack of transparency in LLM-based depression detection by proposing a Chain-of-Thought Prompting approach that breaks down detection into four stages, resulting in superior performance in classification accuracy and diagnostic insights on the E-DAIC dataset.

Depression is one of the leading causes of disability worldwide, posing a severe burden on individuals, healthcare systems, and society at large. Recent advancements in Large Language Models (LLMs) have shown promise in addressing mental health challenges, including the detection of depression through text-based analysis. However, current LLM-based methods often struggle with nuanced symptom identification and lack a transparent, step-by-step reasoning process, making it difficult to accurately classify and explain mental health conditions. To address these challenges, we propose a Chain-of-Thought Prompting approach that enhances both the performance and interpretability of LLM-based depression detection. Our method breaks down the detection process into four stages: (1) sentiment analysis, (2) binary depression classification, (3) identification of underlying causes, and (4) assessment of severity. By guiding the model through these structured reasoning steps, we improve interpretability and reduce the risk of overlooking subtle clinical indicators. We validate our method on the E-DAIC dataset, where we test multiple state-of-the-art large language models. Experimental results indicate that our Chain-of-Thought Prompting technique yields superior performance in both classification accuracy and the granularity of diagnostic insights, compared to baseline approaches.

View on arXiv PDF

Similar