CLApr 7, 2020

Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition

arXiv:2004.03066v21024 citations
AI Analysis

This addresses the understudied ability of NLI models to handle pragmatic inferences for natural language understanding, with incremental contributions in dataset creation and model evaluation.

The paper tackled the problem of evaluating whether natural language inference models learn pragmatic inferences, by creating the IMPPRES dataset with over 25k sentence pairs and testing models like BERT, finding that BERT reliably treats certain implicatures and presuppositions as entailments, while other models show weaker performance.

Natural language inference (NLI) is an increasingly important task for natural language understanding, which requires one to infer whether a sentence entails another. However, the ability of NLI models to make pragmatic inferences remains understudied. We create an IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated sentence pairs illustrating well-studied pragmatic inference types. We use IMPPRES to evaluate whether BERT, InferSent, and BOW NLI models trained on MultiNLI (Williams et al., 2018) learn to make pragmatic inferences. Although MultiNLI appears to contain very few pairs illustrating these inference types, we find that BERT learns to draw pragmatic inferences. It reliably treats scalar implicatures triggered by "some" as entailments. For some presupposition triggers like "only", BERT reliably recognizes the presupposition as an entailment, even when the trigger is embedded under an entailment canceling operator like negation. BOW and InferSent show weaker evidence of pragmatic reasoning. We conclude that NLI training encourages models to learn some, but not all, pragmatic inferences.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes