An Analysis of a BERT Deep Learning Strategy on a Technology Assisted Review Task
This work addresses the challenge for physicians of screening large document volumes in clinical queries, but it is incremental as it applies existing BERT methods to a specific medical review task.
The paper tackled the problem of document screening in Evidence-Based Medicine by proposing a deep learning strategy using BERT embeddings for classification and similarity search. The result showed advanced retrieval performance in initial ranking on CLEF eHealth datasets, outperforming the BM25 plus RM3 model.
Document screening is a central task within Evidenced Based Medicine, which is a clinical discipline that supplements scientific proof to back medical decisions. Given the recent advances in DL (Deep Learning) methods applied to Information Retrieval tasks, I propose a DL document classification approach with BERT or PubMedBERT embeddings and a DL similarity search path using SBERT embeddings to reduce physicians' tasks of screening and classifying immense amounts of documents to answer clinical queries. I test and evaluate the retrieval effectiveness of my DL strategy on the 2017 and 2018 CLEF eHealth collections. I find that the proposed DL strategy works, I compare it to the recently successful BM25 plus RM3 model, and conclude that the suggested method accomplishes advanced retrieval performance in the initial ranking of the articles with the aforementioned datasets, for the CLEF eHealth Technologically Assisted Reviews in Empirical Medicine Task.