CLAILGOct 13, 2022

Predicting Fine-Tuning Performance with Probing

U of Toronto
arXiv:2210.07352v1296 citationsh-index: 43
Originality Incremental advance
AI Analysis

This provides a lightweight method for model developers to estimate performance without full fine-tuning, though it is incremental in linking probing to practical applications.

The paper tackled the problem of predicting fine-tuning performance in large NLP models by using probing test accuracies, achieving errors 40% to 80% smaller than baselines.

Large NLP models have recently shown impressive performance in language understanding tasks, typically evaluated by their fine-tuned performance. Alternatively, probing has received increasing attention as being a lightweight method for interpreting the intrinsic mechanisms of large NLP models. In probing, post-hoc classifiers are trained on "out-of-domain" datasets that diagnose specific abilities. While probing the language models has led to insightful findings, they appear disjointed from the development of models. This paper explores the utility of probing deep NLP models to extract a proxy signal widely used in model development -- the fine-tuning performance. We find that it is possible to use the accuracies of only three probing tests to predict the fine-tuning performance with errors $40\%$ - $80\%$ smaller than baselines. We further discuss possible avenues where probing can empower the development of deep NLP models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes