CLLGSep 22, 2021

Conditional Poisson Stochastic Beam Search

arXiv:2109.11034v313 citations
Originality Incremental advance
AI Analysis

This work addresses bias in decoding for NLP tasks, offering a more effective stochastic alternative to beam search.

The paper tackles the problem of high overlap and bias in beam search for sequence generation by introducing Conditional Poisson stochastic beam search, which samples candidates without replacement to produce lower variance and more efficient estimators than previous stochastic methods.

Beam search is the default decoding strategy for many sequence generation tasks in NLP. The set of approximate K-best items returned by the algorithm is a useful summary of the distribution for many applications; however, the candidates typically exhibit high overlap and may give a highly biased estimate for expectations under our model. These problems can be addressed by instead using stochastic decoding strategies. In this work, we propose a new method for turning beam search into a stochastic process: Conditional Poisson stochastic beam search. Rather than taking the maximizing set at each iteration, we sample K candidates without replacement according to the conditional Poisson sampling design. We view this as a more natural alternative to Kool et. al. 2019's stochastic beam search (SBS). Furthermore, we show how samples generated under the CPSBS design can be used to build consistent estimators and sample diverse sets from sequence models. In our experiments, we observe CPSBS produces lower variance and more efficient estimators than SBS, even showing improvements in high entropy settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes