CLJun 6, 2016

A Decomposable Attention Model for Natural Language Inference

arXiv:1606.01933v21422 citations
Originality Highly original
AI Analysis

This work addresses efficient and accurate natural language inference for NLP applications, offering a novel method that is trivially parallelizable.

The authors tackled natural language inference by proposing a decomposable attention model that achieves state-of-the-art results on the SNLI dataset with significantly fewer parameters and minimal word-order information.

We propose a simple neural architecture for natural language inference. Our approach uses attention to decompose the problem into subproblems that can be solved separately, thus making it trivially parallelizable. On the Stanford Natural Language Inference (SNLI) dataset, we obtain state-of-the-art results with almost an order of magnitude fewer parameters than previous work and without relying on any word-order information. Adding intra-sentence attention that takes a minimum amount of order into account yields further improvements.

Code Implementations10 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes