CLLGApr 16, 2021

An Empirical Study of Extrapolation in Text Generation with Scalar Control

arXiv:2104.07910v11 citations
AI Analysis

This addresses the problem of model generalization for researchers and practitioners in controllable text generation, but it is incremental as it compares existing embedding methods.

The study evaluated how well text generation models extrapolate to unseen scalar control values like length or sentiment, finding that using raw scalar inputs without encoding performed most reliably.

We conduct an empirical evaluation of extrapolation performance when conditioning on scalar control inputs like desired output length, desired edit from an input sentence, and desired sentiment across three text generation tasks. Specifically, we examine a zero-shot setting where models are asked to generalize to ranges of control values not seen during training. We focus on evaluating popular embedding methods for scalar inputs, including both learnable and sinusoidal embeddings, as well as simpler approaches. Surprisingly, our findings indicate that the simplest strategy of using scalar inputs directly, without further encoding, most reliably allows for successful extrapolation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes