AICLLGFeb 8, 2023

Clinical BioBERT Hyperparameter Optimization using Genetic Algorithm

arXiv:2302.03822v22 citationsh-index: 30
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of structuring SDoH data in electronic health records for healthcare professionals, but it is incremental as it applies existing methods to a specific domain.

The paper tackled the problem of extracting Social Determinants of Health (SDoH) from unstructured clinical notes by optimizing a Clinical BioBERT model using a genetic algorithm for hyperparameter tuning, achieving improved accuracy with the AdamW optimizer compared to others like Adafactor and LAMB.

Clinical factors account only for a small portion, about 10-30%, of the controllable factors that affect an individual's health outcomes. The remaining factors include where a person was born and raised, where he/she pursued their education, what their work and family environment is like, etc. These factors are collectively referred to as Social Determinants of Health (SDoH). The majority of SDoH data is recorded in unstructured clinical notes by physicians and practitioners. Recording SDoH data in a structured manner (in an EHR) could greatly benefit from a dedicated ontology of SDoH terms. Our research focuses on extracting sentences from clinical notes, making use of such an SDoH ontology (called SOHO) to provide appropriate concepts. We utilize recent advancements in Deep Learning to optimize the hyperparameters of a Clinical BioBERT model for SDoH text. A genetic algorithm-based hyperparameter tuning regimen was implemented to identify optimal parameter settings. To implement a complete classifier, we pipelined Clinical BioBERT with two subsequent linear layers and two dropout layers. The output predicts whether a text fragment describes an SDoH issue of the patient. We compared the AdamW, Adafactor, and LAMB optimizers. In our experiments, AdamW outperformed the others in terms of accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes