ASLGSDOct 1, 2022

Fine-tuning Wav2vec for Vocal-burst Emotion Recognition

arXiv:2210.00263v13 citationsh-index: 32
Originality Synthesis-oriented
AI Analysis

This work addresses emotion recognition for affective computing, but it is incremental as it applies an existing method (fine-tuning Wav2vec) to a new dataset.

The authors tackled emotion recognition from non-verbal vocal bursts like laughs and cries, achieving promising results compared to the baseline in the A-VB competition.

The ACII Affective Vocal Bursts (A-VB) competition introduces a new topic in affective computing, which is understanding emotional expression using the non-verbal sound of humans. We are familiar with emotion recognition via verbal vocal or facial expression. However, the vocal bursts such as laughs, cries, and signs, are not exploited even though they are very informative for behavior analysis. The A-VB competition comprises four tasks that explore non-verbal information in different spaces. This technical report describes the method and the result of SclabCNU Team for the tasks of the challenge. We achieved promising results compared to the baseline model provided by the organizers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes