LGOct 29, 2024

Online Detection of LLM-Generated Texts via Sequential Hypothesis Testing by Betting

arXiv:2410.22318v310 citationsh-index: 2ICML
Originality Incremental advance
AI Analysis

This addresses the need for media and online platforms to quickly identify LLM-generated content to prevent misinformation, offering an online extension to existing offline detection methods.

The paper tackles the problem of detecting LLM-generated texts in streaming online content, developing a sequential hypothesis testing algorithm that provides statistical guarantees on false positive rate and detection time.

Developing algorithms to differentiate between machine-generated texts and human-written texts has garnered substantial attention in recent years. Existing methods in this direction typically concern an offline setting where a dataset containing a mix of real and machine-generated texts is given upfront, and the task is to determine whether each sample in the dataset is from a large language model (LLM) or a human. However, in many practical scenarios, sources such as news websites, social media accounts, and online forums publish content in a streaming fashion. Therefore, in this online scenario, how to quickly and accurately determine whether the source is an LLM with strong statistical guarantees is crucial for these media or platforms to function effectively and prevent the spread of misinformation and other potential misuse of LLMs. To tackle the problem of online detection, we develop an algorithm based on the techniques of sequential hypothesis testing by betting that not only builds upon and complements existing offline detection techniques but also enjoys statistical guarantees, which include a controlled false positive rate and the expected time to correctly identify a source as an LLM. Experiments were conducted to demonstrate the effectiveness of our method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes