CLAILGSep 16, 2022

On the Relation between Sensitivity and Accuracy in In-context Learning

arXiv:2209.07661v3175 citationsh-index: 76
Originality Incremental advance
AI Analysis

This addresses the unreliability of ICL for real-world applications by improving robustness, though it is incremental as it builds on existing sensitivity analysis.

The paper tackles the problem of in-context learning (ICL) being oversensitive to prompts, which reduces reliability, by finding that prior work underestimated sensitivity due to label bias and showing a negative correlation between sensitivity and accuracy. It proposes SenSel, a selective prediction method that abstains from sensitive predictions, outperforming baselines on ten datasets.

In-context learning (ICL) suffers from oversensitivity to the prompt, making it unreliable in real-world scenarios. We study the sensitivity of ICL with respect to multiple perturbation types. First, we find that label bias obscures the true sensitivity, and therefore prior work may have significantly underestimated ICL sensitivity. Second, we observe a strong negative correlation between ICL sensitivity and accuracy: predictions sensitive to perturbations are less likely to be correct. Motivated by these findings, we propose \textsc{SenSel}, a few-shot selective prediction method that abstains from sensitive predictions. Experiments on ten classification datasets show that \textsc{SenSel} consistently outperforms two commonly used confidence-based and entropy-based baselines on abstention decisions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes