SDApr 11

Learning to Attend to Depression-Related Patterns: An Adaptive Cross-Modal Gating Network for Depression Detection

arXiv:2604.1018136.3h-index: 9
Predicted impact top 70% in SD · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in automatic depression detection, this work addresses the overlooked sparsity of depression-related patterns in speech, but the improvement is incremental over existing methods.

The paper proposes an Adaptive Cross-Modal Gating (ACMG) network for depression detection that selectively attends to sparse, depression-related segments in speech, outperforming baselines without it.

Automatic depression detection using speech signals with acoustic and textual modalities is a promising approach for early diagnosis. Depression-related patterns exhibit sparsity in speech: diagnostically relevant features occur in specific segments rather than being uniformly distributed. However, most existing methods treat all frames equally, assuming depression-related information is uniformly distributed and thus overlooking this sparsity. To address this issue, we proposes a depression detection network based on Adaptive Cross-Modal Gating (ACMG) that adaptively reassigns frame-level weights across both modalities, enabling selective attention to depression-related segments. Experimental results show that the depression detection system with ACMG outperforms baselines without it. Visualization analyses further confirm that ACMG automatically attends to clinically meaningful patterns, including low-energy acoustic segments and textual segments containing negative sentiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes