CLNov 20, 2019

Joint Emotion Label Space Modelling for Affect Lexica

arXiv:1911.08782v310 citations
Originality Incremental advance
AI Analysis

This addresses data poverty in emotion detection for NLP researchers by providing a unified resource, though it is incremental as it builds on existing lexica without surpassing simple concatenation.

The authors tackled the problem of heterogeneous emotion lexica by merging eight existing lexica into a unified 30,273-entry lexicon using a multi-view VAE, which outperformed individual lexica but not a naive concatenation, though it improved performance when added as an extra feature.

Emotion lexica are commonly used resources to combat data poverty in automatic emotion detection. However, vocabulary coverage issues, differences in construction method and discrepancies in emotion framework and representation result in a heterogeneous landscape of emotion detection resources, calling for a unified approach to utilising them. To combat this, we present an extended emotion lexicon of 30,273 unique entries, which is a result of merging eight existing emotion lexica by means of a multi-view variational autoencoder (VAE). We showed that a VAE is a valid approach for combining lexica with different label spaces into a joint emotion label space with a chosen number of dimensions, and that these dimensions are still interpretable. We tested the utility of the unified VAE lexicon by employing the lexicon values as features in an emotion detection model. We found that the VAE lexicon outperformed individual lexica, but contrary to our expectations, it did not outperform a naive concatenation of lexica, although it did contribute to the naive concatenation when added as an extra lexicon. Furthermore, using lexicon information as additional features on top of state-of-the-art language models usually resulted in a better performance than when no lexicon information was used.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes