CLApr 2, 2015

Learning to Understand Phrases by Embedding the Dictionary

arXiv:1504.00548v4200 citations
Originality Highly original
AI Analysis

This work addresses the problem of phrase understanding in NLP, offering a novel approach with practical applications, though it builds on existing distributional models.

The paper tackled the challenge of learning semantic representations for phrases and sentences by using dictionary definitions to bridge lexical and phrasal semantics, resulting in neural embedding models that perform as well or better than commercial systems on reverse dictionaries and crossword question answering tasks.

Distributional models that learn rich semantic word representations are a success story of recent NLP research. However, developing models that learn useful representations of phrases and sentences has proved far harder. We propose using the definitions found in everyday dictionaries as a means of bridging this gap between lexical and phrasal semantics. Neural language embedding models can be effectively trained to map dictionary definitions (phrases) to (lexical) representations of the words defined by those definitions. We present two applications of these architectures: "reverse dictionaries" that return the name of a concept given a definition or description and general-knowledge crossword question answerers. On both tasks, neural language embedding models trained on definitions from a handful of freely-available lexical resources perform as well or better than existing commercial systems that rely on significant task-specific engineering. The results highlight the effectiveness of both neural embedding architectures and definition-based training for developing models that understand phrases and sentences.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes