CLAINov 17, 2023

Use GPT-J Prompt Generation with RoBERTa for NER Models on Diagnosis Extraction of Periodontal Diagnosis from Electronic Dental Records

arXiv:2311.10810v16 citationsh-index: 30
Originality Synthesis-oriented
AI Analysis

This provides an efficient method for mining clinical notes in dentistry, though it is incremental as it applies existing prompt generation techniques to a specific domain.

This study tackled the problem of extracting periodontal diagnoses from electronic dental records using named entity recognition (NER) by employing GPT-J prompt generation with RoBERTa, achieving an F1 score of 0.72 in direct tests and 0.92-0.97 after training.

This study explored the usability of prompt generation on named entity recognition (NER) tasks and the performance in different settings of the prompt. The prompt generation by GPT-J models was utilized to directly test the gold standard as well as to generate the seed and further fed to the RoBERTa model with the spaCy package. In the direct test, a lower ratio of negative examples with higher numbers of examples in prompt achieved the best results with a F1 score of 0.72. The performance revealed consistency, 0.92-0.97 in the F1 score, in all settings after training with the RoBERTa model. The study highlighted the importance of seed quality rather than quantity in feeding NER models. This research reports on an efficient and accurate way to mine clinical notes for periodontal diagnoses, allowing researchers to easily and quickly build a NER model with the prompt generation approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes