CL IRDec 20, 2024

ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers

arXiv:2412.15510v134.1222 citationsh-index: 8BioNLP

Originality Incremental advance

AI Analysis

This work addresses the challenge of early ADE identification from clinical and social media data, which is critical for drug safety, but it is incremental as it builds on existing QA and transformer methods.

The paper tackled the problem of extracting Adverse Drug Events (ADEs) and suspect drugs from unstructured text by introducing ADEQA, a question-answer approach using sequence-to-sequence transformers, achieving a state-of-the-art F1 score of 94% on relationship extraction.

Early identification of Adverse Drug Events (ADE) is critical for taking prompt actions while introducing new drugs into the market. These ADEs information are available through various unstructured data sources like clinical study reports, patient health records, social media posts, etc. Extracting ADEs and the related suspect drugs using machine learning is a challenging task due to the complex linguistic relations between drug ADE pairs in textual data and unavailability of large corpus of labelled datasets. This paper introduces ADEQA, a question-answer(QA) based approach using quasi supervised labelled data and sequence-to-sequence transformers to extract ADEs, drug suspects and the relationships between them. Unlike traditional QA models, natural language generation (NLG) based models don't require extensive token level labelling and thereby reduces the adoption barrier significantly. On a public ADE corpus, we were able to achieve state-of-the-art results with an F1 score of 94% on establishing the relationships between ADEs and the respective suspects.

View on arXiv PDF

Similar