CLJun 24, 2024

PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection

arXiv:2406.16288v212 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses academic integrity and copyright protection concerns by exploring LLMs' capabilities in both creating and identifying plagiarism, though it is incremental in building on existing datasets and methods.

The paper tackles the dual role of large language models (LLMs) in generating and detecting plagiarism by introducing PlagBench, a dataset of 46.5K synthetic text pairs, and finds that GPT-4 outperforms other models and tools by 20% in detection.

Recent studies have raised concerns about the potential threats large language models (LLMs) pose to academic integrity and copyright protection. Yet, their investigation is predominantly focused on literal copies of original texts. Also, how LLMs can facilitate the detection of LLM-generated plagiarism remains largely unexplored. To address these gaps, we introduce \textbf{\sf PlagBench}, a dataset of 46.5K synthetic text pairs that represent three major types of plagiarism: verbatim copying, paraphrasing, and summarization. These samples are generated by three advanced LLMs. We rigorously validate the quality of PlagBench through a combination of fine-grained automatic evaluation and human annotation. We then utilize this dataset for two purposes: (1) to examine LLMs' ability to transform original content into accurate paraphrases and summaries, and (2) to evaluate the plagiarism detection performance of five modern LLMs alongside three specialized plagiarism checkers. Our results show that GPT-3.5 Turbo can produce high-quality paraphrases and summaries without significantly increasing text complexity compared to GPT-4 Turbo. However, in terms of detection, GPT-4 outperforms other LLMs and commercial detection tools by 20%, highlights the evolving capabilities of LLMs not only in content generation but also in plagiarism detection. Data and source code are available at https://github.com/Brit7777/plagbench.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes