CLLGSep 4, 2018

A Novel Neural Sequence Model with Multiple Attentions for Word Sense Disambiguation

arXiv:1809.01074v11 citations
Originality Incremental advance
AI Analysis

This work addresses word sense disambiguation for computational linguistics, offering an incremental improvement by combining existing attention mechanisms.

The paper tackles the gap between supervised models and knowledge-based solutions in word sense disambiguation by proposing a neural sequence model with multiple attentions on linguistic features, achieving state-of-the-art results.

Word sense disambiguation (WSD) is a well researched problem in computational linguistics. Different research works have approached this problem in different ways. Some state of the art results that have been achieved for this problem are by supervised models in terms of accuracy, but they often fall behind flexible knowledge-based solutions which use engineered features as well as human annotators to disambiguate every target word. This work focuses on bridging this gap using neural sequence models incorporating the well-known attention mechanism. The main gist of our work is to combine multiple attentions on different linguistic features through weights and to provide a unified framework for doing this. This weighted attention allows the model to easily disambiguate the sense of an ambiguous word by attending over a suitable portion of a sentence. Our extensive experiments show that multiple attention enables a more versatile encoder-decoder model leading to state of the art results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes