EndToEndML: An Open-Source End-to-End Pipeline for Machine Learning Applications
This work addresses the problem for bioinformatics researchers by providing a user-friendly interface to analyze complex biological data, though it is incremental as it builds on existing AI libraries with a focus on accessibility.
The authors tackled the challenge of applying AI in life sciences by developing an open-source, web-based end-to-end pipeline that enables preprocessing, training, evaluating, and visualizing machine learning models without coding expertise, resulting in a tool that assists in tasks like drug discovery and medical diagnostics.
Artificial intelligence (AI) techniques are widely applied in the life sciences. However, applying innovative AI techniques to understand and deconvolute biological complexity is hindered by the learning curve for life science scientists to understand and use computing languages. An open-source, user-friendly interface for AI models, that does not require programming skills to analyze complex biological data will be extremely valuable to the bioinformatics community. With easy access to different sequencing technologies and increased interest in different 'omics' studies, the number of biological datasets being generated has increased and analyzing these high-throughput datasets is computationally demanding. The majority of AI libraries today require advanced programming skills as well as machine learning, data preprocessing, and visualization skills. In this research, we propose a web-based end-to-end pipeline that is capable of preprocessing, training, evaluating, and visualizing machine learning (ML) models without manual intervention or coding expertise. By integrating traditional machine learning and deep neural network models with visualizations, our library assists in recognizing, classifying, clustering, and predicting a wide range of multi-modal, multi-sensor datasets, including images, languages, and one-dimensional numerical data, for drug discovery, pathogen classification, and medical diagnostics.