CLJun 11, 2020

Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context

arXiv:2006.06259v131.11000 citations

Originality Incremental advance

AI Analysis

This work addresses sarcasm detection in social media, an incremental improvement for natural language processing applications.

The authors tackled sarcasm detection by developing a data augmentation technique using conversational context and a model that handles varying context lengths, achieving the best performance in the FigLang2020 competition on Reddit and Twitter datasets.

We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input-output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.

View on arXiv PDF

Similar