CLJul 3, 2024

Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

arXiv:2407.03103v259 citationsh-index: 13Has Code
AI Analysis

This work addresses the challenge of improving accessibility to psychological counseling by providing a dataset for training AI counselors, though it is incremental as it builds on existing methods for data generation.

The authors tackled the lack of realistic counseling datasets for training open-source large language models by introducing Cactus, a multi-turn dialogue dataset based on Cognitive Behavioral Therapy, and showed that a model trained with it outperforms others in counseling skills.

Recently, the demand for psychological counseling has significantly increased as more individuals express concerns about their mental health. This surge has accelerated efforts to improve the accessibility of counseling by using large language models (LLMs) as counselors. To ensure client privacy, training open-source LLMs faces a key challenge: the absence of realistic counseling datasets. To address this, we introduce Cactus, a multi-turn dialogue dataset that emulates real-life interactions using the goal-oriented and structured approach of Cognitive Behavioral Therapy (CBT). We create a diverse and realistic dataset by designing clients with varied, specific personas, and having counselors systematically apply CBT techniques in their interactions. To assess the quality of our data, we benchmark against established psychological criteria used to evaluate real counseling sessions, ensuring alignment with expert evaluations. Experimental results demonstrate that Camel, a model trained with Cactus, outperforms other models in counseling skills, highlighting its effectiveness and potential as a counseling agent. We make our data, model, and code publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes