CLLGMay 28, 2019

Parallax: Visualizing and Understanding the Semantics of Embedding Spaces via Algebraic Formulae

arXiv:1905.12099v11090 citations
Originality Incremental advance
AI Analysis

This addresses the challenge for researchers and practitioners in machine learning and NLP who need to interpret embeddings, though it is incremental as it builds on existing projection techniques.

The paper tackles the problem of visualizing and understanding embedding spaces by proposing a method that uses algebraic formulae to define interpretable axes for projections, enabling more insightful analyses and comparisons than traditional methods.

Embeddings are a fundamental component of many modern machine learning and natural language processing models. Understanding them and visualizing them is essential for gathering insights about the information they capture and the behavior of the models. State of the art in analyzing embeddings consists in projecting them in two-dimensional planes without any interpretable semantics associated to the axes of the projection, which makes detailed analyses and comparison among multiple sets of embeddings challenging. In this work, we propose to use explicit axes defined as algebraic formulae over embeddings to project them into a lower dimensional, but semantically meaningful subspace, as a simple yet effective analysis and visualization methodology. This methodology assigns an interpretable semantics to the measures of variability and the axes of visualizations, allowing for both comparisons among different sets of embeddings and fine-grained inspection of the embedding spaces. We demonstrate the power of the proposed methodology through a series of case studies that make use of visualizations constructed around the underlying methodology and through a user study. The results show how the methodology is effective at providing more profound insights than classical projection methods and how it is widely applicable to many other use cases.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes