CLJul 8, 2024

PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation

arXiv:2407.05721v354 citationsh-index: 23
Originality Synthesis-oriented
AI Analysis

This work addresses mental health applications by enhancing LLMs for psychological tasks, though it is incremental as it adapts existing methods to a specific domain.

The authors tackled the problem of limited psychological understanding in LLMs by proposing PsycoLLM, a specialized model trained on a high-quality psychological dataset, which achieved superior performance on a comprehensive benchmark based on Chinese counseling exams.

Mental health has attracted substantial attention in recent years and LLM can be an effective technology for alleviating this problem owing to its capability in text understanding and dialogue. However, existing research in this domain often suffers from limitations, such as training on datasets lacking crucial prior knowledge and evidence, and the absence of comprehensive evaluation methods. In this paper, we propose a specialized psychological large language model (LLM), named PsycoLLM, trained on a proposed high-quality psychological dataset, including single-turn QA, multi-turn dialogues and knowledge-based QA. Specifically, we construct multi-turn dialogues through a three-step pipeline comprising multi-turn QA generation, evidence judgment, and dialogue refinement. We augment this process with real-world psychological case backgrounds extracted from online platforms, enhancing the relevance and applicability of the generated data. Additionally, to compare the performance of PsycoLLM with other LLMs, we develop a comprehensive psychological benchmark based on authoritative psychological counseling examinations in China, which includes assessments of professional ethics, theoretical proficiency, and case analysis. The experimental results on the benchmark illustrate the effectiveness of PsycoLLM, which demonstrates superior performance compared to other LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes