CL AIJun 21, 2024

Assessing Good, Bad and Ugly Arguments Generated by ChatGPT: a New Dataset, its Methodology and Associated Tasks

Victor Hugo Nascimento Rocha, Igor Cataneo Silveira, Paulo Pirozelli, Denis Deratani Mauá, Fabio Gagliardi Cozman

arXiv:2406.15130v13.46 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the need for tools to identify fake arguments from LLMs, but is incremental as it focuses on dataset creation and baseline tasks.

The paper tackles the problem of misinformation from Large Language Models by creating a dataset of good, bad, and ugly arguments generated by ChatGPT, and shows that this artificially generated data relates well to human argumentation for training and testing systems.

The recent success of Large Language Models (LLMs) has sparked concerns about their potential to spread misinformation. As a result, there is a pressing need for tools to identify ``fake arguments'' generated by such models. To create these tools, examples of texts generated by LLMs are needed. This paper introduces a methodology to obtain good, bad and ugly arguments from argumentative essays produced by ChatGPT, OpenAI's LLM. We then describe a novel dataset containing a set of diverse arguments, ArGPT. We assess the effectiveness of our dataset and establish baselines for several argumentation-related tasks. Finally, we show that the artificially generated data relates well to human argumentation and thus is useful as a tool to train and test systems for the defined tasks.

View on arXiv PDF Code

Similar