CLAug 13, 2025

AINL-Eval 2025 Shared Task: Detection of AI-Generated Scientific Abstracts in Russian

arXiv:2508.09622v1h-index: 3Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the problem of maintaining academic integrity in scientific publishing, particularly in multilingual contexts, by providing a benchmark for AI-generated text detection, though it is incremental as it builds on existing detection efforts.

The paper introduces the AINL-Eval 2025 Shared Task, tackling the problem of detecting AI-generated scientific abstracts in Russian to address academic integrity challenges, with top systems showing strong performance in identifying such content.

The rapid advancement of large language models (LLMs) has revolutionized text generation, making it increasingly difficult to distinguish between human- and AI-generated content. This poses a significant challenge to academic integrity, particularly in scientific publishing and multilingual contexts where detection resources are often limited. To address this critical gap, we introduce the AINL-Eval 2025 Shared Task, specifically focused on the detection of AI-generated scientific abstracts in Russian. We present a novel, large-scale dataset comprising 52,305 samples, including human-written abstracts across 12 diverse scientific domains and AI-generated counterparts from five state-of-the-art LLMs (GPT-4-Turbo, Gemma2-27B, Llama3.3-70B, Deepseek-V3, and GigaChat-Lite). A core objective of the task is to challenge participants to develop robust solutions capable of generalizing to both (i) previously unseen scientific domains and (ii) models not included in the training data. The task was organized in two phases, attracting 10 teams and 159 submissions, with top systems demonstrating strong performance in identifying AI-generated content. We also establish a continuous shared task platform to foster ongoing research and long-term progress in this important area. The dataset and platform are publicly available at https://github.com/iis-research-team/AINL-Eval-2025.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes