CLMar 18, 2020

X-Stance: A Multilingual Multi-Target Dataset for Stance Detection

arXiv:2003.08385v24.6107 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses stance detection for multilingual and multi-target applications, but it is incremental as it builds on existing methods with a new dataset.

The authors tackled the problem of stance detection across multiple languages and political targets by creating a large-scale dataset from Swiss election comments, achieving moderately successful zero-shot cross-lingual and cross-target transfer with multilingual BERT.

We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. "Do you support X?"). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.

View on arXiv PDF Code

Similar