CLOct 20, 2018

pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference

Mandar Joshi, Eunsol Choi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer

arXiv:1810.08854v232.31123 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses cross-sentence inference tasks like QA and NLI by enhancing existing models with better word-pair representations, though it is incremental as it builds on prior methods.

The paper tackled the problem of reasoning about implied relationships between word pairs for cross-sentence inference by proposing compositional word-pair embeddings learned via pointwise mutual information, resulting in gains of 2.7% on SQuAD2.0 and 1.3% on MultiNLI, with improvements of 6-7% on adversarial datasets.

Reasoning about implied relationships (e.g., paraphrastic, common sense, encyclopedic) between pairs of words is crucial for many cross-sentence inference problems. This paper proposes new methods for learning and using embeddings of word pairs that implicitly represent background knowledge about such relationships. Our pairwise embeddings are computed as a compositional function on word representations, which is learned by maximizing the pointwise mutual information (PMI) with the contexts in which the two words co-occur. We add these representations to the cross-sentence attention layer of existing inference models (e.g. BiDAF for QA, ESIM for NLI), instead of extending or replacing existing word embeddings. Experiments show a gain of 2.7% on the recently released SQuAD2.0 and 1.3% on MultiNLI. Our representations also aid in better generalization with gains of around 6-7% on adversarial SQuAD datasets, and 8.8% on the adversarial entailment test set by Glockner et al. (2018).

View on arXiv PDF Code

Similar