CLLGNEMLJun 30, 2020

Technical Report: Auxiliary Tuning and its Application to Conditional Text Generation

arXiv:2006.16823v122 citations
Originality Incremental advance
AI Analysis

This is an incremental improvement for researchers and practitioners in NLP, offering a more efficient method for task adaptation.

The paper tackles adapting pre-trained language models to new tasks, specifically conditional text generation, by introducing Auxiliary Tuning, which supplements the model with an auxiliary component to shift output distributions, achieving similar results to training from scratch with significantly fewer resources.

We introduce a simple and efficient method, called Auxiliary Tuning, for adapting a pre-trained Language Model to a novel task; we demonstrate this approach on the task of conditional text generation. Our approach supplements the original pre-trained model with an auxiliary model that shifts the output distribution according to the target task. The auxiliary model is trained by adding its logits to the pre-trained model logits and maximizing the likelihood of the target task output. Our method imposes no constraints on the auxiliary architecture. In particular, the auxiliary model can ingest additional input relevant to the target task, independently from the pre-trained model's input. Furthermore, mixing the models at the logits level provides a natural probabilistic interpretation of the method. Our method achieved similar results to training from scratch for several different tasks, while using significantly fewer resources for training; we share a specific example of text generation conditioned on keywords.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes