LG AI SIOct 18, 2024

Efficient Annotator Reliability Assessment and Sample Weighting for Knowledge-Based Misinformation Detection on Social Media

Owen Cook, Charlie Grimshaw, Ben Wu, Sophie Dillon, Jack Hicks, Luke Jones, Thomas Smith, Matyas Szert, Xingyi Song

arXiv:2410.14515v217.612 citationsh-index: 4Has CodeNAACL

Originality Incremental advance

AI Analysis

It addresses the problem of manual misinformation detection for social media platforms by providing an automated, knowledge-based approach with incremental improvements in classification performance.

This study tackled misinformation detection on social media by developing the EffiARA annotation framework to assess annotator reliability and weight samples for training large language models, achieving a macro-F1 of 0.757 with Llama-3.2-1B and 0.740 with TwHIN-BERT-large on a new dataset.

Misinformation spreads rapidly on social media, confusing the truth and targeting potentially vulnerable people. To effectively mitigate the negative impact of misinformation, it must first be accurately detected before applying a mitigation strategy, such as X's community notes, which is currently a manual process. This study takes a knowledge-based approach to misinformation detection, modelling the problem similarly to one of natural language inference. The EffiARA annotation framework is introduced, aiming to utilise inter- and intra-annotator agreement to understand the reliability of each annotator and influence the training of large language models for classification based on annotator reliability. In assessing the EffiARA annotation framework, the Russo-Ukrainian Conflict Knowledge-Based Misinformation Classification Dataset (RUC-MCD) was developed and made publicly available. This study finds that sample weighting using annotator reliability performs the best, utilising both inter- and intra-annotator agreement and soft-label training. The highest classification performance achieved using Llama-3.2-1B was a macro-F1 of 0.757 and 0.740 using TwHIN-BERT-large.

View on arXiv PDF Code

Similar