CL LGNov 14, 2019

Sparse associative memory based on contextual code learning for disambiguating word senses

Max Raphael Sobroza, Tales Marra, Deok-Hee Kim-Dufor, Claude Berrou

arXiv:1911.06415v13 citations

Originality Incremental advance

AI Analysis

This work addresses memory and interpretability issues in word sense disambiguation for natural language processing applications, but it is incremental as it builds on existing pretrained models.

The paper tackles the problem of dense, memory-intensive word representations from pretrained language models in word sense disambiguation by proposing a biologically inspired technique to compress these representations, resulting in improved interpretability, reduced memory footprint, and enhanced performance.

In recent literature, contextual pretrained Language Models (LMs) demonstrated their potential in generalizing the knowledge to several Natural Language Processing (NLP) tasks including supervised Word Sense Disambiguation (WSD), a challenging problem in the field of Natural Language Understanding (NLU). However, word representations from these models are still very dense, costly in terms of memory footprint, as well as minimally interpretable. In order to address such issues, we propose a new supervised biologically inspired technique for transferring large pre-trained language model representations into a compressed representation, for the case of WSD. Our produced representation contributes to increase the general interpretability of the framework and to decrease memory footprint, while enhancing performance.

View on arXiv PDF

Similar