CLOct 12, 2021

DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer

arXiv:2110.05999v1663 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the challenge of maintaining coherence in long text generation for natural language processing applications, representing an incremental improvement over existing methods.

The paper tackles the problem of generating long, coherent text by proposing DiscoDVT, a discourse-aware discrete variational Transformer that learns discrete variables to guide generation, resulting in improved long-range coherence on story generation datasets.

Despite the recent advances in applying pre-trained language models to generate high-quality texts, generating long passages that maintain long-range coherence is yet challenging for these models. In this paper, we propose DiscoDVT, a discourse-aware discrete variational Transformer to tackle the incoherence issue. DiscoDVT learns a discrete variable sequence that summarizes the global structure of the text and then applies it to guide the generation process at each decoding step. To further embed discourse-aware information into the discrete latent representations, we introduce an auxiliary objective to model the discourse relations within the text. We conduct extensive experiments on two open story generation datasets and demonstrate that the latent codes learn meaningful correspondence to the discourse structures that guide the model to generate long texts with better long-range coherence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes