AIJun 1

Bayesian Spectral Emotion Transition Discovery from Multi-Annotator Disagreement

arXiv:2606.0190622.2
Predicted impact top 92% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For researchers studying emotion dynamics in conversations, this work provides a principled method to leverage annotator disagreement instead of discarding it, bridging computational analysis with psychological theory.

The paper proposes BSETD, a two-stage framework that models emotion transitions from multi-annotator soft labels, recovering psychologically meaningful transition patterns (e.g., disgust→anger over-represented, joy→anger under-represented) with high cross-corpus consistency (Pearson r=0.91-0.98 within English, 0.79-0.85 cross-lingual).

Emotions evolve through the dynamics of conversation, and understanding their transition structure is foundational to applications ranging from mental-health screening to dialogue systems. However, existing studies typically compress multi-rater judgments into a single hard label by majority voting, discarding the uncertainty signal needed to understand turn-to-turn transitions. In this article, we propose Bayesian Spectral Emotion Transition Discovery (BSETD), a two-stage framework that discovers emotion-transition structure from multi-rater soft labels. In the first stage, a hierarchical Dirichlet-Multinomial posterior is constructed through the outer product of soft labels, equipping each cell of the K x K transition matrix with a credible interval and Benjamini-Hochberg (BH) false discovery rate (FDR)-controlled significance. In the second stage, the symmetrized graph Laplacian is spectrally decomposed to separate a low-frequency (inertia) component from a high-frequency (contagion) component. On EmotionLines, BSETD simultaneously recovers the signatures of two distinct affective spaces: the Plutchik-adjacent transitions disgust to anger (log2 lift +0.94) and anger to disgust (+0.86) are over-represented, while the Russell-valence-reversed transitions joy to anger (-0.90) and anger to joy (-0.89) are under-represented. A five-source cross-corpus validation yields pairwise Pearson correlations in 0.91-0.98 within English, 0.79-0.85 against Chinese M3ED, and 0.979 between the human hard labels and the LLM virtual soft labels on the same utterance set, demonstrating that a pipeline preserving annotator uncertainty bridges the computational study of emotion dynamics with established psychological theory.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes