MULTIDIMENSIONAL LONGEST COMMON SUBSEQUENCE DISCOVERY From LARGE DATABASE USING DNA OPERATIONS
نویسنده
چکیده
The problem of analysis of biological sequences, is the discovery of sequence similarity of various kinds, in the primary structure of related proteins and genes. This sequence search can be applied to various applications like discovery of association rules, strong rules, correlations, sequential rules, frequent episodes, multidimensional patterns and many other important discovery tasks. In this paper we have proposed two new approaches to find multidimensional LCS and SCS, of N sequences parallely, using DNA operations. These approaches can be used to find MLCS and MSCS, of any window size, from any number of sequences, and from any type of input data. The proposed work can be applied to finding diverging patterns, constraint MLCS, and many more related patterns Implementation results shown the correctness of the algorithms. Finally, the validity of the algorithms are checked and their time complexity is analyzed.
منابع مشابه
DISCOVERY of LONGEST INCREASING SUBSEQUENCES and its VARIANTS using DNA OPERATIONS
The Longest Increasing Subsequence (LIS) and Common Longest Increasing Subsequence (CLIS) have their importance in many data mining applications. We propose algorithms to discover LIS and CLIS from varied databases. This work finds all increasing subsequences from the given database, find increasing subsequences in n sliding window, longest increasing sequences in one and more sequences, decrea...
متن کاملSolving Longest Common Subsequence Problem with Memetic Algorithms
Pattern discovery in unaligned DNA sequences is a challenge problem. A pattern is some specific nucleotide combination that it can be used to measure the similarity degree among biological sequences. The longest common subsequence (LCS) can be viewed as a pattern discovery problem and it is also a well-known NP-hard problem. In this paper, we present a memetic algorithm-based approach to solve ...
متن کاملSolving the Longest Common Subsequence (LCS) Problem using the Associative ASC Processor with Reconfigurable 2D Mesh
Solving the Longest Common Subsequence (LCS) Problem using the Associative ASC Processor with Reconfigurable 2D Mesh Virdi Sabegh Singh, Hong Wang, Robert A. Walker Computer Science Department, Kent State University, Kent OH 44242 USA Abstract As new genes are sequenced, it is necessary for molecular biologists to compare the new gene’s biosequence to known sequences. One simple form of DNA seq...
متن کاملA Specialized Branching and Fathoming Technique for The Longest Common Subsequence Problem
Given a set S = {S1, ..., Sk} of finite strings, the k-longest common subsequence problem (k-LCSP) seeks a string L of maximum length such that L is a subsequence of each Si for i = 1, ..., k. This paper presents a technique, specialized branching, that solves k-LCSP. Specialized branching combines the benefits of both dynamic programming and branch and bound to reduce the search space. For la...
متن کاملA Dynamic Algorithm for the Longest Common Subsequence Problem using Ant Colony Optimization Technique
We present a dynamic algorithm for solving the Longest Common Subsequence Problem using Ant Colony Optimization Technique. The Ant Colony Optimization Technique has been applied to solve many problems in Optimization Theory, Machine Learning and Telecommunication Networks etc. In particular, application of this theory in NP-Hard Problems has a remarkable significance. Given two strings n a a A ...
متن کامل