CLIRJan 24, 2022

Artefact Retrieval: Overview of NLP Models with Knowledge Base Access

arXiv:2201.09651v16 citations
Originality Synthesis-oriented
AI Analysis

This work provides a foundational overview for researchers in NLP, but it is incremental as it organizes existing knowledge rather than introducing new methods.

The paper tackles the problem of systematically categorizing how NLP models access and incorporate knowledge bases, resulting in a typology of artefacts, retrieval mechanisms, and fusion methods that reveals untried design combinations and aids in transferring architectures across NLP tasks.

Many NLP models gain performance by having access to a knowledge base. A lot of research has been devoted to devising and improving the way the knowledge base is accessed and incorporated into the model, resulting in a number of mechanisms and pipelines. Despite the diversity of proposed mechanisms, there are patterns in the designs of such systems. In this paper, we systematically describe the typology of artefacts (items retrieved from a knowledge base), retrieval mechanisms and the way these artefacts are fused into the model. This further allows us to uncover combinations of design decisions that had not yet been tried. Most of the focus is given to language models, though we also show how question answering, fact-checking and knowledgable dialogue models fit into this system as well. Having an abstract model which can describe the architecture of specific models also helps with transferring these architectures between multiple NLP tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes