CLAIMar 31, 2024

On the True Distribution Approximation of Minimum Bayes-Risk Decoding

arXiv:2404.00752v132 citationsh-index: 13NAACL
Originality Synthesis-oriented
AI Analysis

This work addresses a theoretical gap in MBR decoding for text generation, but it is incremental as it focuses on measuring approximation rather than proposing a new method.

The paper tackles the problem of performance variation in Minimum Bayes-Risk (MBR) decoding for text generation, which depends on how well sampled texts approximate the true reference distribution, and shows that anomaly detection scores correlate with this variation, empirically supporting the link between performance and MBR's core assumption.

Minimum Bayes-risk (MBR) decoding has recently gained renewed attention in text generation. MBR decoding considers texts sampled from a model as pseudo-references and selects the text with the highest similarity to the others. Therefore, sampling is one of the key elements of MBR decoding, and previous studies reported that the performance varies by sampling methods. From a theoretical standpoint, this performance variation is likely tied to how closely the samples approximate the true distribution of references. However, this approximation has not been the subject of in-depth study. In this study, we propose using anomaly detection to measure the degree of approximation. We first closely examine the performance variation and then show that previous hypotheses about samples do not correlate well with the variation, but our introduced anomaly scores do. The results are the first to empirically support the link between the performance and the core assumption of MBR decoding.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes