CLJun 2

DMT-CBT: Longitudinal Therapeutic State Modeling for CBT Counseling

arXiv:2606.0313219.3
Predicted impact top 85% in CL · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the mismatch between current LLM-based CBT approaches (local, single-session) and real psychotherapy (longitudinal, multimodal), offering a more realistic modeling paradigm for AI-assisted therapy.

The paper proposes DMT-CBT, a framework for longitudinal therapeutic state modeling in CBT counseling, and shows it improves counseling fidelity, therapeutic alliance, and longitudinal affective trajectories over baselines.

Large language models (LLMs) have shown growing potential for Cognitive Behavioral Therapy (CBT) counseling. However, most existing approaches still formulate counseling as a local response generation problem, focusing on empathetic replies within short, text-only, or single-session interactions. We argue that this formulation fundamentally mismatches the nature of real psychotherapy. In clinical CBT, therapy is a longitudinal process in which therapists continuously infer, update, and intervene on evolving therapeutic states across sessions. Realistic CBT further involves multimodal inference and delayed cross-session intervention effects, requiring models to capture longitudinal therapeutic state evolution under partial observability. We propose DMT-CBT, a framework for Dynamic Modeling of evolving Therapeutic states in CBT counseling. DMT-CBT maintains structured therapeutic states across sessions while incorporating multimodal behavioral grounding and tool-augmented intervention to support adaptive therapeutic reasoning. Based on this framework, we construct DMTCorpus, a synthetic multi-session multimodal CBT counseling dataset featuring evolving therapeutic states, image-grounded client behaviors, and cross-session intervention continuity. Experimental results show that DMT-CBT improves counseling fidelity and therapeutic alliance, produces more favorable longitudinal affective trajectories, and preserves therapeutic states more faithfully than post-hoc extraction approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes