Implementation Improvements to an Rna Pseudoknot Prediction Algorithm
نویسندگان
چکیده
ALGORITHM by YUNZHOU WU (Under the Direction of Liming Cai) ABSTRACT The general problem of RNA pseudoknot prediction is computationally intractable. Most existing algorithms require the worst case CPU time O(N) and RAM space O(N) even for restricted pseudoknot categories. Such resource requirements make it infeasible to predict pseudoknots for RNA sequences of even a moderate length. This research work demonstrates two implementation techniques, memory mapping and parallel computation, to reduce resource usage in the algorithms by taking advantage of the ways the matrix is organized and computed. The techniques are applied to an automated and dynamic programming-based pseudoknot prediction algorithm we recently developed. Our experiments shows that savings in memory usage of approximately 97% and nearly 10-fold speedup (using 16 parallel processors) are achieved for the pseudoknot prediction tests conducted. Most existing RNA prediction algorithms are dynamic programmingbased and evolved from the CYK algorithm for stem-loop prediction; thus our techniques are general and applicable to these algorithms as well.
منابع مشابه
Sparsification Enables Predicting Kissing Hairpin Pseudoknot Structures of Long RNAs in Practice
While computational RNA secondary structure prediction is an important tool in RNA research, it is still fundamentally limited to pseudoknot-free structures (or at best very simple pseudoknots) in practice. Here, we make the prediction of complex pseudoknots – including kissing hairpin structures – practically applicable by reducing the originally high space consumption. For this aim, we apply ...
متن کاملPreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars
Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...
متن کاملDynamic programming based RNA pseudoknot alignment
Pseudoknots are certain structural motifs of RNA molecules. In this thesis we consider the problem of RNA pseudoknot alignment. Most current approaches either discard pseudoknots in order to be efficient or rely on heuristics generating only approximate solutions. This work focuses on dynamic programming based alignment methods and proposes two new approaches for an exact solution of the alignm...
متن کاملHeuristic RNA pseudoknot prediction including intramolecular kissing hairpins.
Pseudoknots are an essential feature of RNA tertiary structures. Simple H-type pseudoknots have been studied extensively in terms of biological functions, computational prediction, and energy models. Intramolecular kissing hairpins are a more complex and biologically important type of pseudoknot in which two hairpin loops form base pairs. They are hard to predict using free energy minimization ...
متن کاملDotKnot: pseudoknot prediction using the probability dot plot under a refined energy model
RNA pseudoknots are functional structure elements with key roles in viral and cellular processes. Prediction of a pseudoknotted minimum free energy structure is an NP-complete problem. Practical algorithms for RNA structure prediction including restricted classes of pseudoknots suffer from high runtime and poor accuracy for longer sequences. A heuristic approach is to search for promising pseud...
متن کامل