CLApr 12, 2022

What do Toothbrushes do in the Kitchen? How Transformers Think our World is Structured

arXiv:2204.05673v1629 citationsh-index: 24
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of evaluating knowledge extraction in language models for researchers in NLP, but it is incremental as it builds on existing bias research.

The paper investigates how well transformer-based language models extract knowledge about object relations compared to static models, finding that static models perform nearly as well or better in some cases, and that classifier-based approaches outperform similarity measures.

Transformer-based models are now predominant in NLP. They outperform approaches based on static models in many respects. This success has in turn prompted research that reveals a number of biases in the language models generated by transformers. In this paper we utilize this research on biases to investigate to what extent transformer-based language models allow for extracting knowledge about object relations (X occurs in Y; X consists of Z; action A involves using X). To this end, we compare contextualized models with their static counterparts. We make this comparison dependent on the application of a number of similarity measures and classifiers. Our results are threefold: Firstly, we show that the models combined with the different similarity measures differ greatly in terms of the amount of knowledge they allow for extracting. Secondly, our results suggest that similarity measures perform much worse than classifier-based approaches. Thirdly, we show that, surprisingly, static models perform almost as well as contextualized models -- in some cases even better.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes