CVAIMar 16, 2023

Emotional Reaction Intensity Estimation Based on Multimodal Data

arXiv:2303.09167v13 citationsh-index: 31
Originality Synthesis-oriented
AI Analysis

This addresses emotion analysis for affective computing applications, but appears incremental as it builds on existing competition frameworks.

The paper tackles emotional reaction intensity estimation from multimodal data, achieving significant improvement in Pearson's Correlation Coefficient compared to baseline methods.

This paper introduces our method for the Emotional Reaction Intensity (ERI) Estimation Challenge, in CVPR 2023: 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW). Based on the multimodal data provided by the originazers, we extract acoustic and visual features with different pretrained models. The multimodal features are mixed together by Transformer Encoders with cross-modal attention mechnism. In this paper, 1. better features are extracted with the SOTA pretrained models. 2. Compared with the baseline, we improve the Pearson's Correlations Coefficient a lot. 3. We process the data with some special skills to enhance performance ability of our model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes