AIHCLGApr 26

Modeling Induced Pleasure through Cognitive Appraisal Prediction via Multimodal Fusion

arXiv:2604.2375311.2
Predicted impact top 96% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in affective computing, this work provides an interpretable method to infer pleasure from videos, but the accuracy is modest and the approach is incremental.

This paper introduces a computational model to predict video-induced pleasure using cognitive appraisal variables, addressing issues like noisy labels and semantic gaps. The model achieves a peak accuracy of 0.6624 in predicting pleasure levels.

Multimodal affective computing analyzes user-generated social media content to predict emotional states. However, a critical gap remains in understanding how visual content shapes cognitive interpretations and elicits specific affective experiences such as pleasure. This study introduces a novel computational model to infer video-induced pleasure via cognitive appraisal variables. The proposed model addresses four challenges: (1) noisy and inconsistent human labels, (2) the semantic gap between "positive emotions" and "pleasure," (3) the scarcity of pleasure-specific datasets, and (4) the limited interpretability of existing black-box fusion methods. Our approach integrates data-driven and cognitive theory-driven methods, using cognitive appraisal theory and a fuzzy model within an innovative framework. The model employs transformer-based architectures and attention mechanisms for fine-grained multimodal feature extraction and interpretable fusion to capture both inter- and intra-modal dynamics associated with pleasure. This enables the prediction of underlying appraisal variables, thereby bridging the semantic gap and enhancing model explainability beyond conventional statistical associations. Experimental results validate the efficacy of the proposed method in detecting video-induced pleasure, achieving a peak accuracy of 0.6624 in predicting pleasure levels. These findings highlight promising implications for affective content recommendation, intelligent media creation, and advancing our understanding of how digital media influences human emotions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes