BMLGJan 8, 2013

An Efficient Algorithm for Upper Bound on the Partition Function of Nucleic Acids

arXiv:1301.1590v13 citations
Originality Incremental advance
AI Analysis

This work addresses a computational bottleneck for ensemble-based RNA structure prediction, which is more reliable than minimum free energy methods, but is incremental as it builds on existing sparse folding algorithms.

The paper tackles the high computational complexity of partition function calculations for RNA and RNA-RNA interactions by developing a fast algorithm to compute an upper bound, achieving space complexity matching sparse folding and time complexity of O(MFE(n)ℓ) in practice.

It has been shown that minimum free energy structure for RNAs and RNA-RNA interaction is often incorrect due to inaccuracies in the energy parameters and inherent limitations of the energy model. In contrast, ensemble based quantities such as melting temperature and equilibrium concentrations can be more reliably predicted. Even structure prediction by sampling from the ensemble and clustering those structures by Sfold [7] has proven to be more reliable than minimum free energy structure prediction. The main obstacle for ensemble based approaches is the computational complexity of the partition function and base pairing probabilities. For instance, the space complexity of the partition function for RNA-RNA interaction is $O(n^4)$ and the time complexity is $O(n^6)$ which are prohibitively large [4,12]. Our goal in this paper is to give a fast algorithm, based on sparse folding, to calculate an upper bound on the partition function. Our work is based on the recent algorithm of Hazan and Jaakkola [10]. The space complexity of our algorithm is the same as that of sparse folding algorithms, and the time complexity of our algorithm is $O(MFE(n)\ell)$ for single RNA and $O(MFE(m, n)\ell)$ for RNA-RNA interaction in practice, in which $MFE$ is the running time of sparse folding and $\ell \leq n$ ($\ell \leq n + m$) is a sequence dependent parameter.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes