CLLGNov 11, 2022

The Architectural Bottleneck Principle

CambridgeETH Zurich
arXiv:2211.06420v164 citationsh-index: 27
Originality Incremental advance
AI Analysis

This addresses a fundamental issue in interpretability for researchers, though it is incremental as it builds on prior probing work.

The paper tackles the problem of measuring how much information a neural network component can extract from its inputs, proposing the architectural bottleneck principle that probes should resemble the component. They find that in BERT, ALBERT, and RoBERTa, syntax trees are mostly extractable by an attentional probe, indicating access to syntactic information.

In this paper, we seek to measure how much information a component in a neural network could extract from the representations fed into it. Our work stands in contrast to prior probing work, most of which investigates how much information a model's representations contain. This shift in perspective leads us to propose a new principle for probing, the architectural bottleneck principle: In order to estimate how much information a given component could extract, a probe should look exactly like the component. Relying on this principle, we estimate how much syntactic information is available to transformers through our attentional probe, a probe that exactly resembles a transformer's self-attention head. Experimentally, we find that, in three models (BERT, ALBERT, and RoBERTa), a sentence's syntax tree is mostly extractable by our probe, suggesting these models have access to syntactic information while composing their contextual representations. Whether this information is actually used by these models, however, remains an open question.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes