2 7 Ja n 20 06 A unifying framework for seed sensitivity and its application to subset seeds ( Extended abstract )
نویسندگان
چکیده
We propose a general approach to compute the seed sensitivity, that can be applied to different definitions of seeds. It treats separately three components of the seed sensitivity problem – a set of target alignments, an associated probability distribution, and a seed model – that are specified by distinct finite automata. The approach is then applied to a new concept of subset seeds for which we propose an efficient automaton construction. Experimental results confirm that sensitive subset seeds can be efficiently designed using our approach, and can then be used in similarity search producing better results than ordinary spaced seeds.
منابع مشابه
in ri a - 00 00 11 64 , v er si on 1 - 2 4 M ar 2 00 6 A unifying framework for seed sensitivity and its application to subset seeds ( Extended abstract )
We propose a general approach to compute the seed sensitivity, that can be applied to different definitions of seeds. It treats separately three components of the seed sensitivity problem – a set of target alignments, an associated probability distribution, and a seed model – that are specified by distinct finite automata. The approach is then applied to a new concept of subset seeds for which ...
متن کاملA unifying framework for seed sensitivity and its application to subset seeds (Extended abstract)
We propose a general approach to compute the seed sensitivity, that can be applied to di erent de nitions of seeds. It treats separately three components of the seed sensitivity problem { a set of target alignments, an associated probability distribution, and a seed model { that are speci ed by distinct nite automata. The approach is then applied to a new concept of subset seeds for which we pr...
متن کاملA Unifying Framework for Seed Sensitivity and Its Application to Subset Seeds
We propose a general approach to compute the seed sensitivity, that can be applied to different definitions of seeds. It treats separately three components of the seed sensitivity problem--a set of target alignments, an associated probability distribution, and a seed model--that are specified by distinct finite automata. The approach is then applied to a new concept of subset seeds for which we...
متن کاملComputing Alignment Seed Sensitivity with Probabilistic Arithmetic Automata
Heuristic sequence alignment and database search algorithms, such as PatternHunter and BLAST, are based on the initial discovery of so-called alignment seeds of well-conserved alignment patterns, which are subsequently extended to full local alignments. In recent years, the theory of classical seeds (matching contiguous q-grams) has been extended to spaced seeds, which allow mismatches within a...
متن کاملEvaluation of the effect of maternal Soybean (Glycine max) nutrition on seed quality traits under accelerated aging test
Extended Abstract Introduction: Structural and physiological delicacy of soybean seeds is known as an important quality indicator in the cultivation of this plant, but at the same time, the most chronic problems of soybean seed quality are the reduction of seed quality during storage and before sowing. The effect of some nutrients on the quality of soybean seeds under accelerated aging stress ...
متن کامل