CL AIJan 7, 2025

IntegrityAI at GenAI Detection Task 2: Detecting Machine-Generated Academic Essays in English and Arabic Using ELECTRA and Stylometry

arXiv:2501.05476v121.320 citationsh-index: 2COLING Workshops

Originality Synthesis-oriented

AI Analysis

This addresses the problem of academic integrity by detecting AI-generated essays, though it is incremental as it applies existing methods to new data.

The paper tackled detecting machine-generated academic essays in English and Arabic by fine-tuning ELECTRA models with stylometric features, achieving F1-scores of 99.7% (2nd out of 26 teams) for English and 98.4% (1st out of 23 teams) for Arabic.

Recent research has investigated the problem of detecting machine-generated essays for academic purposes. To address this challenge, this research utilizes pre-trained, transformer-based models fine-tuned on Arabic and English academic essays with stylometric features. Custom models based on ELECTRA for English and AraELECTRA for Arabic were trained and evaluated using a benchmark dataset. Proposed models achieved excellent results with an F1-score of 99.7%, ranking 2nd among of 26 teams in the English subtask, and 98.4%, finishing 1st out of 23 teams in the Arabic one.

View on arXiv PDF

Similar