CLFeb 23, 2025

All That Glitters is Not Novel: Plagiarism in AI Generated Research

arXiv:2502.16487v319 citationsh-index: 11ACL
Originality Incremental advance
AI Analysis

This highlights a critical issue for academic publishing and AI ethics, as it exposes vulnerabilities in assessing novelty in automated research systems.

The paper tackles the problem of plagiarism in AI-generated research documents, finding that 24% of evaluated documents were plagiarized from existing work and automated detectors are inadequate at catching such plagiarism.

Automating scientific research is considered the final frontier of science. Recently, several papers claim autonomous research agents can generate novel research ideas. Amidst the prevailing optimism, we document a critical concern: a considerable fraction of such research documents are smartly plagiarized. Unlike past efforts where experts evaluate the novelty and feasibility of research ideas, we request $13$ experts to operate under a different situational logic: to identify similarities between LLM-generated research documents and existing work. Concerningly, the experts identify $24\%$ of the $50$ evaluated research documents to be either paraphrased (with one-to-one methodological mapping), or significantly borrowed from existing work. These reported instances are cross-verified by authors of the source papers. The remaining $76\%$ of documents show varying degrees of similarity with existing work, with only a small fraction appearing completely novel. Problematically, these LLM-generated research documents do not acknowledge original sources, and bypass inbuilt plagiarism detectors. Lastly, through controlled experiments we show that automated plagiarism detectors are inadequate at catching plagiarized ideas from such systems. We recommend a careful assessment of LLM-generated research, and discuss the implications of our findings on academic publishing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes