CLLGOct 15, 2021

Meta-learning via Language Model In-context Tuning

arXiv:2110.07814v2661 citations
Originality Incremental advance
AI Analysis

This addresses the problem of few-shot learning for NLP practitioners by offering a more efficient and stable method, though it is incremental as it builds on existing in-context learning and fine-tuning techniques.

The paper tackles meta-learning in NLP by proposing in-context tuning, which frames adaptation as sequence prediction and fine-tunes a pre-trained language model on task collections, resulting in a 6% absolute AUC ROC improvement over MAML and a 10% improvement over non-fine-tuned in-context learning on BinaryClfs.

The goal of meta-learning is to learn to adapt to a new task with only a few labeled examples. To tackle this problem in NLP, we propose $\textit{in-context tuning}$, which recasts adaptation and prediction as a simple sequence prediction problem: to form the input sequence, we concatenate the task instruction, the labeled examples, and the target input to predict; to meta-train the model to learn from in-context examples, we fine-tune a pre-trained language model (LM) to predict the target label from the input sequences on a collection of tasks. We benchmark our method on two collections of text classification tasks: LAMA and BinaryClfs. Compared to first-order MAML which adapts the model with gradient descent, our method better leverages the inductive bias of LMs to perform pattern matching, and outperforms MAML by an absolute $6\%$ AUC ROC score on BinaryClfs, with increasing advantage w.r.t. model size. Compared to non-fine-tuned in-context learning (i.e. prompting a raw LM), in-context tuning directly learns to learn from in-context examples. On BinaryClfs, in-context tuning improves the average AUC-ROC score by an absolute $10\%$, and reduces the variance with respect to example ordering by 6x and example choices by 2x.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes