CLNov 6, 2022

Suffix Retrieval-Augmented Language Modeling

arXiv:2211.03053v22 citationsh-index: 15
AI Analysis

This addresses the challenge of improving language model performance for sequence generation tasks, particularly in spoken dialogue, though it appears incremental as it builds on existing retrieval-augmented methods.

The paper tackled the problem of incorporating bi-directional context into autoregressive language models for sequence generation, proposing SUREALM which uses suffix retrieval to simulate future context, resulting in promising word perplexity reduction on the DSTC9 spoken dialogue corpus.

Causal language modeling (LM) uses word history to predict the next word. BERT, on the other hand, makes use of bi-directional word information in a sentence to predict words at masked positions. While BERT is effective in sequence encoding, it is non-causal by nature and is not designed for sequence generation. In this paper, we propose a novel language model, SUffix REtrieval-Augmented LM (SUREALM), that simulates a bi-directional contextual effect in an autoregressive manner. SUREALM employs an embedding retriever to search for training sentences in a data store that share similar word history during sequence generation. In particular, the suffix portions of the retrieved sentences mimick the "future" context. We evaluated our proposed model on the DSTC9 spoken dialogue corpus and showed promising word perplexity reduction on the validation and test set compared to competitive baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes