CLSep 10, 2021

Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

arXiv:2109.04810v1664 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of integrating extensive factual knowledge into pre-trained models for biomedical applications, representing an incremental advancement in knowledge infusion methods.

The paper tackled the problem of infusing large biomedical knowledge graphs into BERT models to improve performance on knowledge-intensive tasks, achieving new state-of-the-art results on five out of six evaluated datasets.

Infusing factual knowledge into pre-trained models is fundamental for many knowledge-intensive tasks. In this paper, we proposed Mixture-of-Partitions (MoP), an infusion approach that can handle a very large knowledge graph (KG) by partitioning it into smaller sub-graphs and infusing their specific knowledge into various BERT models using lightweight adapters. To leverage the overall factual knowledge for a target task, these sub-graph adapters are further fine-tuned along with the underlying BERT through a mixture layer. We evaluate our MoP with three biomedical BERTs (SciBERT, BioBERT, PubmedBERT) on six downstream tasks (inc. NLI, QA, Classification), and the results show that our MoP consistently enhances the underlying BERTs in task performance, and achieves new SOTA performances on five evaluated datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes