LGAICVGNQMAPMLSep 13, 2023

Reliability-based cleaning of noisy training labels with inductive conformal prediction in multi-modal biomedical data mining

arXiv:2309.07332v14 citationsh-index: 58
Originality Incremental advance
AI Analysis

This method addresses the challenge of labeling biomedical data for researchers and practitioners, offering a way to boost performance without extensive curated data, though it is incremental as it builds on existing semi-supervised and conformal prediction techniques.

The paper tackles the problem of noisy training labels in biomedical data by proposing a reliability-based cleaning method using inductive conformal prediction, which significantly improves classification performance across three tasks, with accuracy enhancements up to 74.6% and AUROC gains up to 23.8%.

Accurately labeling biomedical data presents a challenge. Traditional semi-supervised learning methods often under-utilize available unlabeled data. To address this, we propose a novel reliability-based training data cleaning method employing inductive conformal prediction (ICP). This method capitalizes on a small set of accurately labeled training data and leverages ICP-calculated reliability metrics to rectify mislabeled data and outliers within vast quantities of noisy training data. The efficacy of the method is validated across three classification tasks within distinct modalities: filtering drug-induced-liver-injury (DILI) literature with title and abstract, predicting ICU admission of COVID-19 patients through CT radiomics and electronic health records, and subtyping breast cancer using RNA-sequencing data. Varying levels of noise to the training labels were introduced through label permutation. Results show significant enhancements in classification performance: accuracy enhancement in 86 out of 96 DILI experiments (up to 11.4%), AUROC and AUPRC enhancements in all 48 COVID-19 experiments (up to 23.8% and 69.8%), and accuracy and macro-average F1 score improvements in 47 out of 48 RNA-sequencing experiments (up to 74.6% and 89.0%). Our method offers the potential to substantially boost classification performance in multi-modal biomedical machine learning tasks. Importantly, it accomplishes this without necessitating an excessive volume of meticulously curated training data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes