LGMar 15, 2021

How Many Data Points is a Prompt Worth?

arXiv:2103.08493v2837 citations
AI Analysis

This work addresses the problem of efficient model adaptation for researchers and practitioners in low-data scenarios, though it is incremental in quantifying an existing claim.

The study quantified the benefit of prompting over head-based fine-tuning in low-data regimes, finding that prompting provides an advantage equivalent to hundreds of data points on average across classification tasks.

When fine-tuning pretrained models for classification, researchers either use a generic model head or a task-specific prompt for prediction. Proponents of prompting have argued that prompts provide a method for injecting task-specific guidance, which is beneficial in low-data regimes. We aim to quantify this benefit through rigorous testing of prompts in a fair setting: comparing prompted and head-based fine-tuning in equal conditions across many tasks and data sizes. By controlling for many sources of advantage, we find that prompting does indeed provide a benefit, and that this benefit can be quantified per task. Results show that prompting is often worth 100s of data points on average across classification tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes