CLAICTDec 14, 2015

Sentence Entailment in Compositional Distributional Semantics

arXiv:1512.04419v250 citations
Originality Incremental advance
AI Analysis

This work addresses sentence entailment for natural language processing, but it is incremental as it builds on existing categorical compositional models with new distance measures.

The paper tackled the problem of measuring phrase and sentence entailment in compositional distributional semantics by proposing entropy-based distances for vectors and density matrices, showing that density matrices outperform vectors for word-level entailments and proving compositional extension to phrases and sentences, with preliminary experimental evidence on real and toy datasets.

Distributional semantic models provide vector representations for words by gathering co-occurrence frequencies from corpora of text. Compositional distributional models extend these from words to phrases and sentences. In categorical compositional distributional semantics, phrase and sentence representations are functions of their grammatical structure and representations of the words therein. In this setting, grammatical structures are formalised by morphisms of a compact closed category and meanings of words are formalised by objects of the same category. These can be instantiated in the form of vectors or density matrices. This paper concerns the applications of this model to phrase and sentence level entailment. We argue that entropy-based distances of vectors and density matrices provide a good candidate to measure word-level entailment, show the advantage of density matrices over vectors for word level entailments, and prove that these distances extend compositionally from words to phrases and sentences. We exemplify our theoretical constructions on real data and a toy entailment dataset and provide preliminary experimental evidence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes