CLApr 22, 2021

Low Anisotropy Sense Retrofitting (LASeR) : Towards Isotropic and Sense Enriched Representations

arXiv:2104.10833v131.8729 citations

Originality Synthesis-oriented

AI Analysis

This addresses the representation degeneration problem in NLP for researchers and practitioners, but it is incremental as it builds on existing models with a post-processing step.

The paper tackled the problem of anisotropic and poorly explained sense disambiguation in contextual word representations from pretrained language models, proposing LASeR as a post-processing method to make representations isotropic and sense-enriched, with results showing improved geometry and semantic meaningfulness.

Contextual word representation models have shown massive improvements on a multitude of NLP tasks, yet their word sense disambiguation capabilities remain poorly explained. To address this gap, we assess whether contextual word representations extracted from deep pretrained language models create distinguishable representations for different senses of a given word. We analyze the representation geometry and find that most layers of deep pretrained language models create highly anisotropic representations, pointing towards the existence of representation degeneration problem in contextual word representations. After accounting for anisotropy, our study further reveals that there is variability in sense learning capabilities across different language models. Finally, we propose LASeR, a 'Low Anisotropy Sense Retrofitting' approach that renders off-the-shelf representations isotropic and semantically more meaningful, resolving the representation degeneration problem as a post-processing step, and conducting sense-enrichment of contextualized representations extracted from deep neural language models.

View on arXiv PDF

Similar