CL LGSep 19, 2021

UPV at CheckThat! 2021: Mitigating Cultural Differences for Identifying Multilingual Check-worthy Claims

Ipek Baris Schlicht, Angel Felipe Magnossão de Paula, Paolo Rosso

arXiv:2109.09232v11.412 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the understudied issue of cultural differences in automated fact-checking for multilingual social media, but it is incremental as it builds on existing datasets and methods.

The paper tackled the problem of multilingual check-worthy claim detection by proposing joint training with a language identification task to mitigate cultural bias, resulting in performance gains for some languages in the CLEF-2021 dataset.

Identifying check-worthy claims is often the first step of automated fact-checking systems. Tackling this task in a multilingual setting has been understudied. Encoding inputs with multilingual text representations could be one approach to solve the multilingual check-worthiness detection. However, this approach could suffer if cultural bias exists within the communities on determining what is check-worthy.In this paper, we propose a language identification task as an auxiliary task to mitigate unintended bias.With this purpose, we experiment joint training by using the datasets from CLEF-2021 CheckThat!, that contain tweets in English, Arabic, Bulgarian, Spanish and Turkish. Our results show that joint training of language identification and check-worthy claim detection tasks can provide performance gains for some of the selected languages.

View on arXiv PDF Code

Similar