CLLGSep 19, 2021

UPV at CheckThat! 2021: Mitigating Cultural Differences for Identifying Multilingual Check-worthy Claims

arXiv:2109.09232v112 citations
AI Analysis

This addresses the understudied issue of cultural differences in automated fact-checking for multilingual social media, but it is incremental as it builds on existing datasets and methods.

The paper tackled the problem of multilingual check-worthy claim detection by proposing joint training with a language identification task to mitigate cultural bias, resulting in performance gains for some languages in the CLEF-2021 dataset.

Identifying check-worthy claims is often the first step of automated fact-checking systems. Tackling this task in a multilingual setting has been understudied. Encoding inputs with multilingual text representations could be one approach to solve the multilingual check-worthiness detection. However, this approach could suffer if cultural bias exists within the communities on determining what is check-worthy.In this paper, we propose a language identification task as an auxiliary task to mitigate unintended bias.With this purpose, we experiment joint training by using the datasets from CLEF-2021 CheckThat!, that contain tweets in English, Arabic, Bulgarian, Spanish and Turkish. Our results show that joint training of language identification and check-worthy claim detection tasks can provide performance gains for some of the selected languages.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes