LGJan 14, 2025

Modeling Quantum Machine Learning for Genomic Data Analysis

arXiv:2501.08193v18 citationsh-index: 26Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses genomic data classification using quantum machine learning, but it is incremental as it evaluates existing methods on a new dataset.

The study investigated quantum machine learning models for binary classification of genome sequence data, finding that Pegasos-QSVC achieved high recall and QNNs had the highest training accuracy, but performance varied significantly with feature mapping techniques.

Quantum Machine Learning (QML) continues to evolve, unlocking new opportunities for diverse applications. In this study, we investigate and evaluate the applicability of QML models for binary classification of genome sequence data by employing various feature mapping techniques. We present an open-source, independent Qiskit-based implementation to conduct experiments on a benchmark genomic dataset. Our simulations reveal that the interplay between feature mapping techniques and QML algorithms significantly influences performance. Notably, the Pegasos Quantum Support Vector Classifier (Pegasos-QSVC) exhibits high sensitivity, particularly excelling in recall metrics, while Quantum Neural Networks (QNN) achieve the highest training accuracy across all feature maps. However, the pronounced variability in classifier performance, dependent on feature mapping, highlights the risk of overfitting to localized output distributions in certain scenarios. This work underscores the transformative potential of QML for genomic data classification while emphasizing the need for continued advancements to enhance the robustness and accuracy of these methodologies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes