CLOct 18, 2023

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly

Harvard
arXiv:2310.12300v2132 citationsh-index: 59
Originality Incremental advance
AI Analysis

This provides a more efficient tool for analyzing in-context learning in large language models, but it is incremental as it adapts an existing metric.

The paper tackled the problem of efficiently measuring hardness in in-context learning by adapting pointwise V-usable information to an in-context version, showing it is stable across exemplar selections and shot numbers without requiring fine-tuning.

In-context learning (ICL) is a new learning paradigm that has gained popularity along with the development of large language models. In this work, we adapt a recently proposed hardness metric, pointwise $\mathcal{V}$-usable information (PVI), to an in-context version (in-context PVI). Compared to the original PVI, in-context PVI is more efficient in that it requires only a few exemplars and does not require fine-tuning. We conducted a comprehensive empirical analysis to evaluate the reliability of in-context PVI. Our findings indicate that in-context PVI estimates exhibit similar characteristics to the original PVI. Specific to the in-context setting, we show that in-context PVI estimates remain consistent across different exemplar selections and numbers of shots. The variance of in-context PVI estimates across different exemplar selections is insignificant, which suggests that in-context PVI are stable. Furthermore, we demonstrate how in-context PVI can be employed to identify challenging instances. Our work highlights the potential of in-context PVI and provides new insights into the capabilities of ICL.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes