HCAIAug 14, 2023

Detecting The Corruption Of Online Questionnaires By Artificial Intelligence

arXiv:2308.07499v124 citationsh-index: 48
Originality Synthesis-oriented
AI Analysis

This addresses a critical data integrity issue for researchers using crowd-sourcing platforms, but it is incremental as it highlights an existing problem without proposing a new solution.

The study tackled the problem of AI-generated responses corrupting online questionnaires, finding that humans could detect AI authorship with 76% accuracy but automatic detection systems were completely unusable, threatening data quality.

Online questionnaires that use crowd-sourcing platforms to recruit participants have become commonplace, due to their ease of use and low costs. Artificial Intelligence (AI) based Large Language Models (LLM) have made it easy for bad actors to automatically fill in online forms, including generating meaningful text for open-ended tasks. These technological advances threaten the data quality for studies that use online questionnaires. This study tested if text generated by an AI for the purpose of an online study can be detected by both humans and automatic AI detection systems. While humans were able to correctly identify authorship of text above chance level (76 percent accuracy), their performance was still below what would be required to ensure satisfactory data quality. Researchers currently have to rely on the disinterest of bad actors to successfully use open-ended responses as a useful tool for ensuring data quality. Automatic AI detection systems are currently completely unusable. If AIs become too prevalent in submitting responses then the costs associated with detecting fraudulent submissions will outweigh the benefits of online questionnaires. Individual attention checks will no longer be a sufficient tool to ensure good data quality. This problem can only be systematically addressed by crowd-sourcing platforms. They cannot rely on automatic AI detection systems and it is unclear how they can ensure data quality for their paying clients.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes