CLDec 19, 2020

Lexically-constrained Text Generation through Commonsense Knowledge Extraction and Injection

Yikang Li, Pulkit Goel, Varsha Kuppur Rajendra, Har Simrat Singh, Jonathan Francis, Kaixin Ma, Eric Nyberg, Alessandro Oltramari

arXiv:2012.10813v11.05 citations

Originality Incremental advance

AI Analysis

This work aims to improve the commonsense reasoning and lexical constraint adherence of text generation models for researchers working on conditional text generation, representing an incremental improvement.

This paper addresses the challenge of generating lexically-constrained text that aligns with human commonsense, specifically on the Commongen benchmark. The authors propose extracting commonsense relations from Conceptnet, injecting them into a Unified Language Model via attention, and enforcing lexical requirements through output constraints, resulting in generated sentences that are more aligned with human understanding and compliant with lexical requirements.

Conditional text generation has been a challenging task that is yet to see human-level performance from state-of-the-art models. In this work, we specifically focus on the Commongen benchmark, wherein the aim is to generate a plausible sentence for a given set of input concepts. Despite advances in other tasks, large pre-trained language models that are fine-tuned on this dataset often produce sentences that are syntactically correct but qualitatively deviate from a human understanding of common sense. Furthermore, generated sequences are unable to fulfill such lexical requirements as matching part-of-speech and full concept coverage. In this paper, we explore how commonsense knowledge graphs can enhance model performance, with respect to commonsense reasoning and lexically-constrained decoding. We propose strategies for enhancing the semantic correctness of the generated text, which we accomplish through: extracting commonsense relations from Conceptnet, injecting these relations into the Unified Language Model (UniLM) through attention mechanisms, and enforcing the aforementioned lexical requirements through output constraints. By performing several ablations, we find that commonsense injection enables the generation of sentences that are more aligned with human understanding, while remaining compliant with lexical requirements.

View on arXiv PDF

Similar