CLFeb 1, 2021

Measuring and Improving Consistency in Pretrained Language Models

arXiv:2102.01017v2739 citations
AI Analysis

This addresses the reliability of PLMs for knowledge representation, which is crucial for applications in natural language processing, though it is incremental as it builds on existing evaluation and improvement techniques.

The authors tackled the problem of consistency in pretrained language models (PLMs) regarding factual knowledge, finding that all tested PLMs had poor consistency with high variance between relations, and they proposed a method that improved consistency.

Consistency of a model -- that is, the invariance of its behavior under meaning-preserving alternations in its input -- is a highly desirable property in natural language processing. In this paper we study the question: Are Pretrained Language Models (PLMs) consistent with respect to factual knowledge? To this end, we create ParaRel, a high-quality resource of cloze-style query English paraphrases. It contains a total of 328 paraphrases for 38 relations. Using ParaRel, we show that the consistency of all PLMs we experiment with is poor -- though with high variance between relations. Our analysis of the representational spaces of PLMs suggests that they have a poor structure and are currently not suitable for representing knowledge robustly. Finally, we propose a method for improving model consistency and experimentally demonstrate its effectiveness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes