CLLGJul 23, 2019

MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models

arXiv:1908.01816v17 citations
AI Analysis

This work addresses the challenge of enhancing text understanding in sequence-to-sequence tasks for natural language processing applications, representing an incremental advancement by applying existing machine comprehension insights to new models.

The paper tackles the problem of improving sequence-to-sequence models by transferring knowledge from machine comprehension, resulting in significant performance gains in neural machine translation and achieving state-of-the-art results in abstractive text summarization on the Gigaword dataset.

Machine Comprehension (MC) is one of the core problems in natural language processing, requiring both understanding of the natural language and knowledge about the world. Rapid progress has been made since the release of several benchmark datasets, and recently the state-of-the-art models even surpass human performance on the well-known SQuAD evaluation. In this paper, we transfer knowledge learned from machine comprehension to the sequence-to-sequence tasks to deepen the understanding of the text. We propose MacNet: a novel encoder-decoder supplementary architecture to the widely used attention-based sequence-to-sequence models. Experiments on neural machine translation (NMT) and abstractive text summarization show that our proposed framework can significantly improve the performance of the baseline models, and our method for the abstractive text summarization achieves the state-of-the-art results on the Gigaword dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes