Dynamic programming based RNA pseudoknot alignment
نویسنده
چکیده
Pseudoknots are certain structural motifs of RNA molecules. In this thesis we consider the problem of RNA pseudoknot alignment. Most current approaches either discard pseudoknots in order to be efficient or rely on heuristics generating only approximate solutions. This work focuses on dynamic programming based alignment methods and proposes two new approaches for an exact solution of the alignment problem in the presence of pseudoknot structures. The first approach is able to handle arbitrary pseudoknots, however, does not guarantee a polynomial runtime for all instances, due to the NP-hardness of the problem. Nevertheless, an analysis in terms of parameterized complexity shows that the algorithm is fixed parameter tractable for a parameter that is small in practice. The second approach is a general scheme for the alignment of restricted classes of pseudoknots in polynomial time. It is motivated by existing RNA pseudoknot prediction algorithms. We show how to embed seven of those algorithms in a common scheme and present an analogous scheme for the alignment problem, which yields for each of the structure prediction algorithms a corresponding alignment algorithm. The alignment algorithms handle the same class of pseudoknots as the corresponding prediction algorithms and the time and space complexity is only increased by a linear factor, compared to the respective prediction algorithm. Both approaches have been implemented to evaluate their applicability in practice.
منابع مشابه
Fast and accurate search for non-coding RNA pseudoknot structures in genomes
MOTIVATION Searching genomes for non-coding RNAs (ncRNAs) by their secondary structure has become an important goal for bioinformatics. For pseudoknot-free structures, ncRNA search can be effective based on the covariance model and CYK-type dynamic programming. However, the computational difficulty in aligning an RNA sequence to a pseudoknot has prohibited fast and accurate search of arbitrary ...
متن کاملLifting Prediction to Alignment of RNA Pseudoknots
Prediction and alignment of RNA pseudoknot structures are NP-hard. Nevertheless, several efficient prediction algorithms by dynamic programming have been proposed for restricted classes of pseudoknots. We present a general scheme that yields an efficient alignment algorithm for arbitrary such classes. Moreover, we show that such an alignment algorithm benefits from the class restriction in the ...
متن کاملRNA Secondary Structure Alignment Based on Stem Representation
The comparison methods for RNA or protein molecules are important and basic tools in molecular biology. So far, most comparison methods, such as sequence alignment, are only applicable to the primary structures of biomolecules. Indeed, the functions of biomolecules have close relationship in their structures. The RNA secondary structure alignment problem is to align two given RNA structures to ...
متن کاملImplementation Improvements to an Rna Pseudoknot Prediction Algorithm
ALGORITHM by YUNZHOU WU (Under the Direction of Liming Cai) ABSTRACT The general problem of RNA pseudoknot prediction is computationally intractable. Most existing algorithms require the worst case CPU time O(N) and RAM space O(N) even for restricted pseudoknot categories. Such resource requirements make it infeasible to predict pseudoknots for RNA sequences of even a moderate length. This rese...
متن کاملRna Structural Alignment by Means of Lagrangian Relaxation
This thesis deals with an important topic in computational molecular biology: structurally correct alignments of RNA sequences. Compared to DNA sequences where sequence information is normally sufficient for adequate alignments, the structural aspects of RNA have to be taken into account when dealing with RNA sequences: structure of RNA sequences tends to remain conserved throughout evolution. ...
متن کامل