CLJan 21, 2023

Syntax-guided Neural Module Distillation to Probe Compositionality in Sentence Embeddings

CMU
arXiv:2301.08998v2269 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the causal impact of syntax on compositionality in sentence embeddings, which is incremental as it builds on existing probing methods.

The study tackled the problem of probing compositionality in sentence embedding models by constructing a neural module net based on syntax parse to approximate transformer embeddings, finding that distillability correlates with model performance and syntax-guided composition is largely linear.

Past work probing compositionality in sentence embedding models faces issues determining the causal impact of implicit syntax representations. Given a sentence, we construct a neural module net based on its syntax parse and train it end-to-end to approximate the sentence's embedding generated by a transformer model. The distillability of a transformer to a Syntactic NeurAl Module Net (SynNaMoN) then captures whether syntax is a strong causal model of its compositional ability. Furthermore, we address questions about the geometry of semantic composition by specifying individual SynNaMoN modules' internal architecture & linearity. We find differences in the distillability of various sentence embedding models that broadly correlate with their performance, but observe that distillability doesn't considerably vary by model size. We also present preliminary evidence that much syntax-guided composition in sentence embedding models is linear, and that non-linearities may serve primarily to handle non-compositional phrases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes