CLDec 14, 2021

Massive-scale Decoding for Text Generation using Lattices

arXiv:2112.07660v2627 citations
AI Analysis

This provides a foundation for downstream applications needing diverse text generation, but it is incremental as it builds on existing decoding methods.

The paper tackles the problem of generating diverse text outputs by introducing a search algorithm that constructs lattices encoding thousands of high-quality and grammatical options, improving efficiency through best-first search and hypothesis recombination.

Conditional neural text generation models generate high-quality outputs, but often concentrate around a mode when what we really want is a diverse set of options. We present a search algorithm to construct lattices encoding a massive number of generation options. First, we restructure decoding as a best-first search, which explores the space differently than beam search and improves efficiency by avoiding pruning paths. Second, we revisit the idea of hypothesis recombination: we can identify pairs of similar generation candidates during search and merge them as an approximation. On both summarization and machine translation, we show that our algorithm encodes thousands of diverse options that remain grammatical and high-quality into one lattice. This algorithm provides a foundation for building downstream generation applications on top of massive-scale diverse outputs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes