CLDec 22, 2023

Efficacy of Machine-Generated Instructions

arXiv:2312.14423v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the resource and cost constraints in fine-tuning models for NLP tasks, but it is incremental as it builds on existing methods.

The study tackled the problem of limited human-written instruction data for fine-tuning language models by evaluating machine-generated annotations, finding that they achieved 78.54% correctness and the fine-tuned model reached 96.01% of the performance with human annotations.

Large "instruction-tuned" language models (i.e., finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is often limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We conducted a quantitative study to figure out the efficacy of machine-generated annotations, where we compare the results of a fine-tuned BERT model with human v/s machine-generated annotations. Applying our methods to the vanilla GPT-3 model, we saw that machine generated annotations were 78.54% correct and the fine-tuned model achieved a 96.01% model performance compared to the performance with human-labelled annotations. This result shows that machine-generated annotations are a resource and cost effective way to fine-tune down-stream models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes