CLJul 2, 2020

Fact-based Text Editing

arXiv:2007.00916v131.21006 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the practical need for accurate text editing to reflect truth, though it is incremental as it builds on existing table-to-text datasets and neural methods.

The paper tackles the problem of revising documents to better align with facts in a knowledge base, proposing a new task called fact-based text editing and introducing FactEditor, a neural network that outperforms encoder-decoder models in fidelity and fluency while being faster.

We propose a novel text editing task, referred to as \textit{fact-based text editing}, in which the goal is to revise a given document to better describe the facts in a knowledge base (e.g., several triples). The task is important in practice because reflecting the truth is a common requirement in text editing. First, we propose a method for automatically generating a dataset for research on fact-based text editing, where each instance consists of a draft text, a revised text, and several facts represented in triples. We apply the method into two public table-to-text datasets, obtaining two new datasets consisting of 233k and 37k instances, respectively. Next, we propose a new neural network architecture for fact-based text editing, called \textsc{FactEditor}, which edits a draft text by referring to given facts using a buffer, a stream, and a memory. A straightforward approach to address the problem would be to employ an encoder-decoder model. Our experimental results on the two datasets show that \textsc{FactEditor} outperforms the encoder-decoder approach in terms of fidelity and fluency. The results also show that \textsc{FactEditor} conducts inference faster than the encoder-decoder approach.

View on arXiv PDF Code

Similar