CLLGNov 26, 2015

The Mechanism of Additive Composition

arXiv:1511.08407v428 citations
Originality Incremental advance
AI Analysis

This provides a theoretical foundation for a widely used method in natural language processing, which is incremental but addresses a known bottleneck in compositional semantics.

The authors tackled the problem of understanding the theoretical basis for additive composition in phrase meaning computation, proving an upper bound for its bias in terms of collocation strength and showing that stronger word collocations lead to more accurate approximations.

Additive composition (Foltz et al, 1998; Landauer and Dumais, 1997; Mitchell and Lapata, 2010) is a widely used method for computing meanings of phrases, which takes the average of vector representations of the constituent words. In this article, we prove an upper bound for the bias of additive composition, which is the first theoretical analysis on compositional frameworks from a machine learning point of view. The bound is written in terms of collocation strength; we prove that the more exclusively two successive words tend to occur together, the more accurate one can guarantee their additive composition as an approximation to the natural phrase vector. Our proof relies on properties of natural language data that are empirically verified, and can be theoretically derived from an assumption that the data is generated from a Hierarchical Pitman-Yor Process. The theory endorses additive composition as a reasonable operation for calculating meanings of phrases, and suggests ways to improve additive compositionality, including: transforming entries of distributional word vectors by a function that meets a specific condition, constructing a novel type of vector representations to make additive composition sensitive to word order, and utilizing singular value decomposition to train word vectors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes