CLAIDLITMay 22, 2023

Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph

arXiv:2305.12900v22 citations
Originality Synthesis-oriented
AI Analysis

It addresses the domain knowledge capture gap for scholarly data in low-resource settings, offering an incentive for more investment in this area, though it is incremental as it adapts existing methods to a new domain.

This work tackles the problem of object prediction in scholarly knowledge graphs by applying prompt-based training of transformer models, achieving performance boosts of up to 40% in a relaxed evaluation setting.

There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings. The prompt-based training approach has been found to be effective in generalizing pre-trained or fine-tuned models for transfer to resource-scarce settings. This work, for the first time, reports results on adopting prompt-based training of transformers for \textit{scholarly knowledge graph object prediction}. The work is unique in the following two main aspects. 1) It deviates from the other works proposing entity and relation extraction pipelines for predicting objects of a scholarly knowledge graph. 2) While other works have tested the method on text genera relatively close to the general knowledge domain, we test the method for a significantly different domain, i.e. scholarly knowledge, in turn testing the linguistic, probabilistic, and factual generalizability of these large-scale transformer models. We find that (i) per expectations, transformer models when tested out-of-the-box underperform on a new domain of data, (ii) prompt-based training of the models achieve performance boosts of up to 40\% in a relaxed evaluation setting, and (iii) testing the models on a starkly different domain even with a clever training objective in a low resource setting makes evident the domain knowledge capture gap offering an empirically-verified incentive for investing more attention and resources to the scholarly domain in the context of transformer models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes