CLApr 13, 2021

DirectProbe: Studying Representations without Classifiers

arXiv:2104.05904v1738 citations
Originality Incremental advance
AI Analysis

This work addresses a methodological issue for NLP researchers by providing a more reliable way to probe embeddings, though it is incremental as it builds on existing probing concepts.

The paper tackled the problem of unreliable probing methods for studying linguistic structures in contextualized embeddings by introducing DirectProbe, a heuristic that directly analyzes representation geometry without training classifiers, showing it can reveal label representations and predict classifier performance across several linguistic tasks.

Understanding how linguistic structures are encoded in contextualized embedding could help explain their impressive performance across NLP@. Existing approaches for probing them usually call for training classifiers and use the accuracy, mutual information, or complexity as a proxy for the representation's goodness. In this work, we argue that doing so can be unreliable because different representations may need different classifiers. We develop a heuristic, DirectProbe, that directly studies the geometry of a representation by building upon the notion of a version space for a task. Experiments with several linguistic tasks and contextualized embeddings show that, even without training classifiers, DirectProbe can shine light into how an embedding space represents labels, and also anticipate classifier performance for the representation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes