CLOct 16, 2020

Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models

arXiv:2010.08566v4721 citations
Originality Highly original
AI Analysis

This addresses the limitation of off-the-shelf language models for tasks that break unidirectional assumptions, offering an unsupervised solution that narrows the gap with supervised approaches.

The paper tackled the problem of applying unidirectional language models to non-sequential tasks like paraphrasing and text-infilling without supervision, and the result was that Reflective Decoding outperformed strong unsupervised baselines and even surpassed some supervised methods on various metrics.

Publicly available, large pretrained LanguageModels (LMs) generate text with remarkable quality, but only sequentially from left to right. As a result, they are not immediately applicable to generation tasks that break the unidirectional assumption, such as paraphrasing or text-infilling, necessitating task-specific supervision. In this paper, we present Reflective Decoding, a novel unsupervised algorithm that allows for direct application of unidirectional LMs to non-sequential tasks. Our 2-step approach requires no supervision or even parallel corpora, only two off-the-shelf pretrained LMs in opposite directions: forward and backward. First, in the contextualization step, we use LMs to generate ensembles of past and future contexts which collectively capture the input (e.g. the source sentence for paraphrasing). Second, in the reflection step, we condition on these "context ensembles", generating outputs that are compatible with them. Comprehensive empirical results demonstrate that Reflective Decoding outperforms strong unsupervised baselines on both paraphrasing and abductive text infilling, significantly narrowing the gap between unsupervised and supervised methods. Reflective Decoding surpasses multiple supervised baselines on various metrics including human evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes