CLMay 19, 2022

Are Prompt-based Models Clueless?

arXiv:2205.09295v2645 citationsh-index: 15
AI Analysis

This reveals a limitation in prompt-based models for natural language understanding, showing they are not immune to dataset biases, which is incremental as it extends prior findings from finetuned models.

The paper investigates whether few-shot prompt-based models exploit superficial cues in datasets, finding that they do, performing well on cued instances but poorly on those without cues.

Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement by reusing the language model head and formatting the task input to match the pre-training objective. Therefore, it is expected that few-shot prompt-based models do not exploit superficial cues. This paper presents an empirical examination of whether few-shot prompt-based models also exploit superficial cues. Analyzing few-shot prompt-based models on MNLI, SNLI, HANS, and COPA has revealed that prompt-based models also exploit superficial cues. While the models perform well on instances with superficial cues, they often underperform or only marginally outperform random accuracy on instances without superficial cues.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes