CLOct 18, 2023

Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly

Sheng Lu, Shan Chen, Yingya Li, Danielle Bitterman, Guergana Savova, Iryna Gurevych

Harvard

arXiv:2310.12300v221.1132 citationsh-index: 59Has Code

Originality Incremental advance

AI Analysis

This provides a more efficient tool for analyzing in-context learning in large language models, but it is incremental as it adapts an existing metric.

The paper tackled the problem of efficiently measuring hardness in in-context learning by adapting pointwise V-usable information to an in-context version, showing it is stable across exemplar selections and shot numbers without requiring fine-tuning.

In-context learning (ICL) is a new learning paradigm that has gained popularity along with the development of large language models. In this work, we adapt a recently proposed hardness metric, pointwise $\mathcal{V}$-usable information (PVI), to an in-context version (in-context PVI). Compared to the original PVI, in-context PVI is more efficient in that it requires only a few exemplars and does not require fine-tuning. We conducted a comprehensive empirical analysis to evaluate the reliability of in-context PVI. Our findings indicate that in-context PVI estimates exhibit similar characteristics to the original PVI. Specific to the in-context setting, we show that in-context PVI estimates remain consistent across different exemplar selections and numbers of shots. The variance of in-context PVI estimates across different exemplar selections is insignificant, which suggests that in-context PVI are stable. Furthermore, we demonstrate how in-context PVI can be employed to identify challenging instances. Our work highlights the potential of in-context PVI and provides new insights into the capabilities of ICL.

View on arXiv PDF Code

Similar