CLAIOct 31, 2024

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios

arXiv:2410.23746v341 citationsh-index: 31Has CodeNIPS
Originality Synthesis-oriented
AI Analysis

This addresses the reliability of text detection for applications prone to LLM misuse, such as academic or content domains, and is incremental by focusing on benchmarking rather than proposing a new detection method.

The study tackled the problem of detecting LLM-generated text in real-world scenarios by creating a new benchmark, DetectRL, which showed that state-of-the-art detectors underperformed, with performance dropping significantly under adversarial conditions.

Detecting text generated by large language models (LLMs) is of great recent interest. With zero-shot methods like DetectGPT, detection capabilities have reached impressive levels. However, the reliability of existing detectors in real-world applications remains underexplored. In this study, we present a new benchmark, DetectRL, highlighting that even state-of-the-art (SOTA) detection techniques still underperformed in this task. We collected human-written datasets from domains where LLMs are particularly prone to misuse. Using popular LLMs, we generated data that better aligns with real-world applications. Unlike previous studies, we employed heuristic rules to create adversarial LLM-generated text, simulating various prompts usages, human revisions like word substitutions, and writing noises like spelling mistakes. Our development of DetectRL reveals the strengths and limitations of current SOTA detectors. More importantly, we analyzed the potential impact of writing styles, model types, attack methods, the text lengths, and real-world human writing factors on different types of detectors. We believe DetectRL could serve as an effective benchmark for assessing detectors in real-world scenarios, evolving with advanced attack methods, thus providing more stressful evaluation to drive the development of more efficient detectors. Data and code are publicly available at: https://github.com/NLP2CT/DetectRL.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes