CLSep 14, 2022

Natural Language Inference Prompts for Zero-shot Emotion Classification in Text across Corpora

arXiv:2209.06701v2584 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work addresses the challenge of flexible label sets in emotion classification for domains like social media and blogs, but it is incremental as it builds on existing zero-shot learning and natural language inference methods.

The paper tackles the problem of zero-shot emotion classification across different text corpora by analyzing how sensitive natural language inference models are to prompt formulations, showing that combining multiple prompts yields robust performance nearly matching the best individual prompt for each corpus.

Within textual emotion classification, the set of relevant labels depends on the domain and application scenario and might not be known at the time of model development. This conflicts with the classical paradigm of supervised learning in which the labels need to be predefined. A solution to obtain a model with a flexible set of labels is to use the paradigm of zero-shot learning as a natural language inference task, which in addition adds the advantage of not needing any labeled training data. This raises the question how to prompt a natural language inference model for zero-shot learning emotion classification. Options for prompt formulations include the emotion name anger alone or the statement "This text expresses anger". With this paper, we analyze how sensitive a natural language inference-based zero-shot-learning classifier is to such changes to the prompt under consideration of the corpus: How carefully does the prompt need to be selected? We perform experiments on an established set of emotion datasets presenting different language registers according to different sources (tweets, events, blogs) with three natural language inference models and show that indeed the choice of a particular prompt formulation needs to fit to the corpus. We show that this challenge can be tackled with combinations of multiple prompts. Such ensemble is more robust across corpora than individual prompts and shows nearly the same performance as the individual best prompt for a particular corpus.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes