LGGNMLDec 6, 2019

Deep Bayesian Recurrent Neural Networks for Somatic Variant Calling in Cancer

arXiv:1912.04174v1
Originality Incremental advance
AI Analysis

This work addresses the need for reliable confidence estimates in somatic variant calling for oncologists, enabling more robust clinical decisions, though it is incremental as it builds on existing neural network methods.

The paper tackled the problem of accurately identifying somatic variants in cancer DNA sequencing data by using deep Bayesian neural networks to provide probability estimates for variant calls, achieving similar performance to standard neural networks while offering confidence intervals.

The emerging field of precision oncology relies on the accurate pinpointing of alterations in the molecular profile of a tumor to provide personalized targeted treatments. Current methodologies in the field commonly include the application of next generation sequencing technologies to a tumor sample, followed by the identification of mutations in the DNA known as somatic variants. The differentiation of these variants from sequencing error poses a classic classification problem, which has traditionally been approached with Bayesian statistics, and more recently with supervised machine learning methods such as neural networks. Although these methods provide greater accuracy, classic neural networks lack the ability to indicate the confidence of a variant call. In this paper, we explore the performance of deep Bayesian neural networks on next generation sequencing data, and their ability to give probability estimates for somatic variant calls. In addition to demonstrating similar performance in comparison to standard neural networks, we show that the resultant output probabilities make these better suited to the disparate and highly-variable sequencing data-sets these models are likely to encounter in the real world. We aim to deliver algorithms to oncologists for which model certainty better reflects accuracy, for improved clinical application. By moving away from point estimates to reliable confidence intervals, we expect the resultant clinical and treatment decisions to be more robust and more informed by the underlying reality of the tumor molecular profile.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes