CLSep 14, 2021

KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning

arXiv:2109.06704v13 citations
Originality Incremental advance
AI Analysis

This work addresses limitations in pre-trained language models for high-quality generation tasks like commonsense reasoning, offering potential commercial value, but it is incremental as it builds on existing encoder-decoder architectures with novel modules.

The paper tackles the problem of improving generative commonsense reasoning in natural language generation by proposing KFCNet, which uses knowledge filtering and contrastive learning, resulting in significant performance gains on the CommonGen benchmark, such as a +6.6 point increase in BLEU-4.

Pre-trained language models have led to substantial gains over a broad range of natural language processing (NLP) tasks, but have been shown to have limitations for natural language generation tasks with high-quality requirements on the output, such as commonsense generation and ad keyword generation. In this work, we present a novel Knowledge Filtering and Contrastive learning Network (KFCNet) which references external knowledge and achieves better generation performance. Specifically, we propose a BERT-based filter model to remove low-quality candidates, and apply contrastive learning separately to each of the encoder and decoder, within a general encoder--decoder architecture. The encoder contrastive module helps to capture global target semantics during encoding, and the decoder contrastive module enhances the utility of retrieved prototypes while learning general features. Extensive experiments on the CommonGen benchmark show that our model outperforms the previous state of the art by a large margin: +6.6 points (42.5 vs. 35.9) for BLEU-4, +3.7 points (33.3 vs. 29.6) for SPICE, and +1.3 points (18.3 vs. 17.0) for CIDEr. We further verify the effectiveness of the proposed contrastive module on ad keyword generation, and show that our model has potential commercial value.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes