CLAIJan 21, 2025

LuxVeri at GenAI Detection Task 1: Inverse Perplexity Weighted Ensemble for Robust Detection of AI-Generated Text across English and Multilingual Contexts

arXiv:2501.11914v120 citationsh-index: 2COLING Workshops
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of robust AI-generated text detection for applications in content moderation and verification, though it is incremental as it builds on existing ensemble techniques.

The paper tackled the problem of detecting AI-generated text in English and multilingual contexts by using an inverse perplexity weighted ensemble of models, achieving Macro F1-scores of 0.7458 (ranking 12th out of 35 teams) for English and 0.7513 (ranking 4th out of 25 teams) for multilingual detection.

This paper presents a system developed for Task 1 of the COLING 2025 Workshop on Detecting AI-Generated Content, focusing on the binary classification of machine-generated versus human-written text. Our approach utilizes an ensemble of models, with weights assigned according to each model's inverse perplexity, to enhance classification accuracy. For the English text detection task, we combined RoBERTa-base, RoBERTa-base with the OpenAI detector, and BERT-base-cased, achieving a Macro F1-score of 0.7458, which ranked us 12th out of 35 teams. We ensembled RemBERT, XLM-RoBERTa-base, and BERT-base-multilingual-case for the multilingual text detection task, employing the same inverse perplexity weighting technique. This resulted in a Macro F1-score of 0.7513, positioning us 4th out of 25 teams. Our results demonstrate the effectiveness of inverse perplexity weighting in improving the robustness of machine-generated text detection across both monolingual and multilingual settings, highlighting the potential of ensemble methods for this challenging task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes