CLMay 6, 2022

Bridging the Domain Gap for Stance Detection for the Zulu language

arXiv:2205.03153v10.233 citationsh-index: 36
AI Analysis50

This addresses misinformation detection for low-resource languages like Zulu, though it is incremental as it adapts existing English methods.

The paper tackles the problem of stance detection for the Zulu language by bridging the domain gap from English using domain adaptation techniques, achieving results comparable to English without requiring human expertise in Zulu.

Misinformation has become a major concern in recent last years given its spread across our information sources. In the past years, many NLP tasks have been introduced in this area, with some systems reaching good results on English language datasets. Existing AI based approaches for fighting misinformation in literature suggest automatic stance detection as an integral first step to success. Our paper aims at utilizing this progress made for English to transfers that knowledge into other languages, which is a non-trivial task due to the domain gap between English and the target languages. We propose a black-box non-intrusive method that utilizes techniques from Domain Adaptation to reduce the domain gap, without requiring any human expertise in the target language, by leveraging low-quality data in both a supervised and unsupervised manner. This allows us to rapidly achieve similar results for stance detection for the Zulu language, the target language in this work, as are found for English. We also provide a stance detection dataset in the Zulu language. Our experimental results show that by leveraging English datasets and machine translation we can increase performances on both English data along with other languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes