CLHCJan 20, 2024

How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation

arXiv:2401.11185v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of generating effective adversarial questions for LLMs, which is incremental as it builds on existing methods with new guidance and metrics.

The study examined how large language models (LLMs) both hinder and aid dynamic adversarial question generation, finding that authors could create challenging questions but often produced poor ones; they proposed new metrics and incentives, resulting in a new dataset.

Dynamic adversarial question generation, where humans write examples to stump a model, aims to create examples that are realistic and informative. However, the advent of large language models (LLMs) has been a double-edged sword for human authors: more people are interested in seeing and pushing the limits of these models, but because the models are so much stronger an opponent, they are harder to defeat. To understand how these models impact adversarial question writing process, we enrich the writing guidance with LLMs and retrieval models for the authors to reason why their questions are not adversarial. While authors could create interesting, challenging adversarial questions, they sometimes resort to tricks that result in poor questions that are ambiguous, subjective, or confusing not just to a computer but also to humans. To address these issues, we propose new metrics and incentives for eliciting good, challenging questions and present a new dataset of adversarially authored questions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes