CLAILGNESep 22, 2015

Reasoning about Entailment with Neural Attention

arXiv:1509.06664v4778 citations
Originality Highly original
AI Analysis

This work addresses the challenge of improving entailment recognition for natural language processing applications, representing a significant advance over prior approaches.

The paper tackles the problem of textual entailment recognition by proposing a neural model with LSTM units and a word-by-word attention mechanism, achieving state-of-the-art accuracy on a large dataset and outperforming previous neural and feature-based methods.

While most approaches to automatically recognizing entailment relations have used classifiers employing hand engineered features derived from complex natural language processing pipelines, in practice their performance has been only slightly better than bag-of-word pair classifiers using only lexical similarity. The only attempt so far to build an end-to-end differentiable neural network for entailment failed to outperform such a simple similarity classifier. In this paper, we propose a neural model that reads two sentences to determine entailment using long short-term memory units. We extend this model with a word-by-word neural attention mechanism that encourages reasoning over entailments of pairs of words and phrases. Furthermore, we present a qualitative analysis of attention weights produced by this model, demonstrating such reasoning capabilities. On a large entailment dataset this model outperforms the previous best neural model and a classifier with engineered features by a substantial margin. It is the first generic end-to-end differentiable system that achieves state-of-the-art accuracy on a textual entailment dataset.

Code Implementations7 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes