Application of Markov Structure of Genomes to Outlier Identification and Read Classification
This work addresses bioinformatics problems for genome analysis, but it appears incremental as it applies an existing Markov structure method to specific virus data.
The authors tackled outlier identification in genome databases and read classification in metagenomics by modeling genomes as second-order Markov processes based on triplet base distributions, applying this to real coronavirus and adenovirus data.
In this paper we apply the structure of genomes as second-order Markov processes specified by the distributions of successive triplets of bases to two bioinformatics problems: identification of outliers in genome databases and read classification in metagenomics, using real coronavirus and adenovirus data.