CLAug 2, 2024

IAI Group at CheckThat! 2024: Transformer Models and Data Augmentation for Checkworthy Claim Detection

Peter Røysland Aarnes, Vinay Setty, Petra Galuščáková

arXiv:2408.01118v11.05 citationsh-index: 4Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of identifying claims needing verification for fact-checkers, though it appears incremental with variable performance across languages.

The paper tackled automated check-worthy claim detection in English, Dutch, and Arabic political debates and Twitter data, achieving ninth-best in English, third-best in Dutch, and top placement in Arabic on the CheckThat! 2024 leaderboard.

This paper describes IAI group's participation for automated check-worthiness estimation for claims, within the framework of the 2024 CheckThat! Lab "Task 1: Check-Worthiness Estimation". The task involves the automated detection of check-worthy claims in English, Dutch, and Arabic political debates and Twitter data. We utilized various pre-trained generative decoder and encoder transformer models, employing methods such as few-shot chain-of-thought reasoning, fine-tuning, data augmentation, and transfer learning from one language to another. Despite variable success in terms of performance, our models achieved notable placements on the organizer's leaderboard: ninth-best in English, third-best in Dutch, and the top placement in Arabic, utilizing multilingual datasets for enhancing the generalizability of check-worthiness detection. Despite a significant drop in performance on the unlabeled test dataset compared to the development test dataset, our findings contribute to the ongoing efforts in claim detection research, highlighting the challenges and potential of language-specific adaptations in claim verification systems.

View on arXiv PDF Code

Similar