Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages
This work addresses hate speech detection for Arabic language users, but it is incremental as it applies existing methods to a specific dataset.
The paper tackled hate speech detection in Arabic languages by evaluating transformer models and ensemble methods, achieving an F1-score of 0.60 and accuracy of 0.86 on the test set.
This paper describes our participation in the shared task of hate speech detection, which is one of the subtasks of the CERIST NLP Challenge 2022. Our experiments evaluate the performance of six transformer models and their combination using 2 ensemble approaches. The best results on the training set, in a five-fold cross validation scenario, were obtained by using the ensemble approach based on the majority vote. The evaluation of this approach on the test set resulted in an F1-score of 0.60 and an Accuracy of 0.86.