AIGNAug 31, 2016

Binary Particle Swarm Optimization versus Hybrid Genetic Algorithm for Inferring Well Supported Phylogenetic Trees

arXiv:1608.08749v15 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of building reliable large-scale phylogenetic trees for plant species by reducing computational complexity in gene selection, but it is incremental as it compares BPSO with existing hybrid genetic algorithms.

The paper tackled the problem of inferring well-supported phylogenetic trees from chloroplast genomes by identifying the largest subset of core genes that minimizes problematic genes, using a distributed Binary Particle Swarm Optimization (BPSO) approach. Results showed encouraging outcomes when applied to various plant families, though specific numerical gains were not detailed.

The amount of completely sequenced chloroplast genomes increases rapidly every day, leading to the possibility to build large-scale phylogenetic trees of plant species. Considering a subset of close plant species defined according to their chloroplasts, the phylogenetic tree that can be inferred by their core genes is not necessarily well supported, due to the possible occurrence of problematic genes (i.e., homoplasy, incomplete lineage sorting, horizontal gene transfers, etc.) which may blur the phylogenetic signal. However, a trustworthy phylogenetic tree can still be obtained provided such a number of blurring genes is reduced. The problem is thus to determine the largest subset of core genes that produces the best-supported tree. To discard problematic genes and due to the overwhelming number of possible combinations, this article focuses on how to extract the largest subset of sequences in order to obtain the most supported species tree. Due to computational complexity, a distributed Binary Particle Swarm Optimization (BPSO) is proposed in sequential and distributed fashions. Obtained results from both versions of the BPSO are compared with those computed using an hybrid approach embedding both genetic algorithms and statistical tests. The proposal has been applied to different cases of plant families, leading to encouraging results for these families.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes