CLAIOct 17, 2022

Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints

arXiv:2210.09440v2223 citationsh-index: 52Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of biomedical NLP for cancer detection in clinical notes, particularly in low-resource settings, but it is incremental as it applies existing efficient fine-tuning methods to a specific domain.

The paper tackled the problem of identifying cancer in clinical notes under low-resource constraints by evaluating machine learning techniques, finding that fine-tuning a frozen BERT model with bottleneck adapters outperformed other strategies, including full fine-tuning of BioBERT.

Processing information locked within clinical health records is a challenging task that remains an active area of research in biomedical NLP. In this work, we evaluate a broad set of machine learning techniques ranging from simple RNNs to specialised transformers such as BioBERT on a dataset containing clinical notes along with a set of annotations indicating whether a sample is cancer-related or not. Furthermore, we specifically employ efficient fine-tuning methods from NLP, namely, bottleneck adapters and prompt tuning, to adapt the models to our specialised task. Our evaluations suggest that fine-tuning a frozen BERT model pre-trained on natural language and with bottleneck adapters outperforms all other strategies, including full fine-tuning of the specialised BioBERT model. Based on our findings, we suggest that using bottleneck adapters in low-resource situations with limited access to labelled data or processing capacity could be a viable strategy in biomedical text mining. The code used in the experiments are going to be made available at https://github.com/omidrohanian/bottleneck-adapters.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes