CLJun 29, 2021

Time-Aware Language Models as Temporal Knowledge Bases

arXiv:2106.15110v2703 citations
Originality Incremental advance
AI Analysis

This addresses the issue of outdated factual knowledge in language models for applications requiring up-to-date information, though it is incremental as it builds on existing temporal modeling approaches.

The authors tackled the problem of language models being limited by static training data, which fails to capture facts that change over time, by introducing a diagnostic dataset and a technique for modeling text with timestamps, resulting in improved memorization of seen facts and better calibration on unseen future facts.

Many facts come with an expiration date, from the name of the President to the basketball team Lebron James plays for. But language models (LMs) are trained on snapshots of data collected at a specific moment in time, and this can limit their utility, especially in the closed-book setting where the pretraining corpus must contain the facts the model should memorize. We introduce a diagnostic dataset aimed at probing LMs for factual knowledge that changes over time and highlight problems with LMs at either end of the spectrum -- those trained on specific slices of temporal data, as well as those trained on a wide range of temporal data. To mitigate these problems, we propose a simple technique for jointly modeling text with its timestamp. This improves memorization of seen facts from the training time period, as well as calibration on predictions about unseen facts from future time periods. We also show that models trained with temporal context can be efficiently "refreshed" as new data arrives, without the need for retraining from scratch.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes