DBCLQMOct 16, 2023

Node-based Knowledge Graph Contrastive Learning for Medical Relationship Prediction

arXiv:2310.10138v12 citationsh-index: 12Has Code
Originality Highly original
AI Analysis

This work addresses a specific bottleneck in knowledge graph embedding for biomedical AI, offering an incremental improvement with a novel method for constructing contrastive pairs.

The paper tackles the challenge of constructing suitable contrastive pairs for knowledge graph embedding in biomedical applications, proposing NC-KGE, a node-based contrastive learning method that enhances knowledge extraction and speeds up training convergence, achieving competitive performance on public datasets and outperforming baselines in biomedical relationship prediction tasks, such as on PharmKG8k-28, DRKG17k-21, and BioKG72k-14.

The embedding of Biomedical Knowledge Graphs (BKGs) generates robust representations, valuable for a variety of artificial intelligence applications, including predicting drug combinations and reasoning disease-drug relationships. Meanwhile, contrastive learning (CL) is widely employed to enhance the distinctiveness of these representations. However, constructing suitable contrastive pairs for CL, especially within Knowledge Graphs (KGs), has been challenging. In this paper, we proposed a novel node-based contrastive learning method for knowledge graph embedding, NC-KGE. NC-KGE enhances knowledge extraction in embeddings and speeds up training convergence by constructing appropriate contrastive node pairs on KGs. This scheme can be easily integrated with other knowledge graph embedding (KGE) methods. For downstream task such as biochemical relationship prediction, we have incorporated a relation-aware attention mechanism into NC-KGE, focusing on the semantic relationships and node interactions. Extensive experiments show that NC-KGE performs competitively with state-of-the-art models on public datasets like FB15k-237 and WN18RR. Particularly in biomedical relationship prediction tasks, NC-KGE outperforms all baselines on datasets such as PharmKG8k-28, DRKG17k-21, and BioKG72k-14, especially in predicting drug combination relationships. We release our code at https://github.com/zhi520/NC-KGE.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes