CLLGMay 3, 2023

Transfer and Active Learning for Dissonance Detection: Addressing the Rare-Class Challenge

arXiv:2305.02459v2222 citations
Originality Incremental advance
AI Analysis

This work addresses data acquisition for rare-class tasks like detecting cognitive dissonance in social media, offering incremental improvements in active learning strategies.

The paper tackled the rare-class challenge in dissonance detection by evaluating transfer- and active learning strategies, finding that a proposed probability-of-rare-class (PRC) approach effectively improves model accuracy, while transfer learning aids cold-start performance but not active learning iterations.

While transformer-based systems have enabled greater accuracies with fewer training examples, data acquisition obstacles still persist for rare-class tasks -- when the class label is very infrequent (e.g. < 5% of samples). Active learning has in general been proposed to alleviate such challenges, but choice of selection strategy, the criteria by which rare-class examples are chosen, has not been systematically evaluated. Further, transformers enable iterative transfer-learning approaches. We propose and investigate transfer- and active learning solutions to the rare class problem of dissonance detection through utilizing models trained on closely related tasks and the evaluation of acquisition strategies, including a proposed probability-of-rare-class (PRC) approach. We perform these experiments for a specific rare class problem: collecting language samples of cognitive dissonance from social media. We find that PRC is a simple and effective strategy to guide annotations and ultimately improve model accuracy while transfer-learning in a specific order can improve the cold-start performance of the learner but does not benefit iterations of active learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes