CL LGJun 25, 2023

Interpretable Neural Embeddings with Sparse Self-Representation

CMU

arXiv:2306.14135v10.5h-index: 14

Originality Incremental advance

AI Analysis

This work addresses the need for more stable and interpretable word embeddings in natural language processing, though it is incremental as it builds on existing sparse representation methods.

The paper tackles the problem of interpretability in word embeddings by proposing a method that combines data self-representation with a shallow neural network to learn expressive, interpretable embeddings, resulting in comparable or slightly better interpretability and competitive performance on downstream tasks.

Interpretability benefits the theoretical understanding of representations. Existing word embeddings are generally dense representations. Hence, the meaning of latent dimensions is difficult to interpret. This makes word embeddings like a black-box and prevents them from being human-readable and further manipulation. Many methods employ sparse representation to learn interpretable word embeddings for better interpretability. However, they also suffer from the unstable issue of grouped selection in $\ell1$ and online dictionary learning. Therefore, they tend to yield different results each time. To alleviate this challenge, we propose a novel method to associate data self-representation with a shallow neural network to learn expressive, interpretable word embeddings. In experiments, we report that the resulting word embeddings achieve comparable and even slightly better interpretability than baseline embeddings. Besides, we also evaluate that our approach performs competitively well on all downstream tasks and outperforms benchmark embeddings on a majority of them.

View on arXiv PDF

Similar