CLAINov 13, 2021

GraphPrompt: Graph-Based Prompt Templates for Biomedical Synonym Prediction

arXiv:2112.03002v26 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the tedious curation of biomedical datasets by automating synonym mapping, though it is incremental as it builds on existing prompt-based and graph-based methods for a specific domain.

The paper tackles the problem of biomedical synonym prediction, where terms must be mapped to ontologies without context clues, by proposing GraphPrompt, a prompt-based learning method that uses graph-based templates, achieving 37.2% and 28.5% improvements in zero-shot and few-shot settings respectively.

In the expansion of biomedical dataset, the same category may be labeled with different terms, thus being tedious and onerous to curate these terms. Therefore, automatically mapping synonymous terms onto the ontologies is desirable, which we name as biomedical synonym prediction task. Unlike biomedical concept normalization (BCN), no clues from context can be used to enhance synonym prediction, making it essential to extract graph features from ontology. We introduce an expert-curated dataset OBO-syn encompassing 70 different types of concepts and 2 million curated concept-term pairs for evaluating synonym prediction methods. We find BCN methods perform weakly on this task for not making full use of graph information. Therefore, we propose GraphPrompt, a prompt-based learning approach that creates prompt templates according to the graphs. GraphPrompt obtained 37.2\% and 28.5\% improvement on zero-shot and few-shot settings respectively, indicating the effectiveness of these graph-based prompt templates. We envision that our method GraphPrompt and OBO-syn dataset can be broadly applied to graph-based NLP tasks, and serve as the basis for analyzing diverse and accumulating biomedical data. All the data and codes are avalible at: https://github.com/HanwenXuTHU/GraphPrompt

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes