CLMay 6, 2018

Breaking NLI Systems with Sentences that Require Simple Lexical Inferences

arXiv:1805.02266v11267 citations
Originality Synthesis-oriented
AI Analysis

This work addresses a critical limitation in NLI for AI and NLP researchers, though it is incremental as it focuses on exposing existing weaknesses rather than proposing a new solution.

The authors tackled the problem of natural language inference (NLI) systems failing to generalize to simple lexical inferences by creating a new test set that reveals deficiencies in state-of-the-art models, resulting in substantially worse performance compared to standard benchmarks like SNLI.

We create a new NLI test set that shows the deficiency of state-of-the-art models in inferences that require lexical and world knowledge. The new examples are simpler than the SNLI test set, containing sentences that differ by at most one word from sentences in the training set. Yet, the performance on the new test set is substantially worse across systems trained on SNLI, demonstrating that these systems are limited in their generalization ability, failing to capture many simple inferences.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes