Efficient string-matching allowing for non-overlapping inversions
نویسندگان
چکیده
Inversions are a class of chromosomal mutations, widely regarded as one of the major mechanisms for reorganizing the genome. In this paper we present a new algorithm for the approximate string matching problem allowing for non-overlapping inversions which runs in O(nm) worst-case time and O(m2) space, for a character sequence of size n and pattern of size m. This improves upon a previous O(nm2)-time algorithm. In additionwe present a variant of our algorithmwith the same complexity in theworst case, but with a O(n) time complexity in the average case. © 2012 Published by Elsevier B.V.
منابع مشابه
Efficient Matching of Biological Sequences Allowing for Non-overlapping Inversions
Inversions are a class of chromosomal mutations, widely regarded as one of the major mechanisms for reorganizing the genome. In this paper we present a new algorithm for the approximate string matching problem allowing for non-overlapping inversions which runs in O(nm) worst-case time and O(m)-space, for a character sequence of size n and pattern of size m. This improves upon a previous O(nm)-t...
متن کاملApproximate String Matching Allowing for Inversions and Translocations
The approximate string matching problem consists in finding all locations at which a pattern P of length m matches a substring of a text T of length n, after a given finite number of edit operations. In this paper we investigate such problem when the string distance involves translocations of equal length adjacent factors and inversions of factors. In particular, we devise a O(nmmax(α, β))-time...
متن کاملApproximate String Matching for Geographic Names and Personal Names
The problem of matching strings allowing errors has recently gained importance, considering the increasing volume of online textual data. In geotechnologies, approximate string matching algorithms find many applications, such as gazetteers, address matching, and geographic information retrieval. This paper presents a novel method for approximate string matching, developed for the recognition of...
متن کاملText searching allowing for inversions and translocations of factors
The approximate string matching problem consists in finding all locations at which a pattern p of length m matches a substring of a text t of length n, after a finite number of given edit operations. In this paper,we investigate such a problemwhen the edit operations are translocations of adjacent factors of equal length and inversions of factors. In particular, we first present anO(nmmax(α, β)...
متن کاملString Matching with Inversions and Translocations in Linear Average Time (Most of the Time)
We present an efficient algorithm for finding all approximate occurrences of a given pattern p of length m in a text t of length n allowing for translocations of equal length adjacent factors and inversions of factors. The algorithm is based on an efficient filtering method and has an O(nmmax(α, β))-time complexity in the worst case and O(max(α, β))space complexity, where α and β are respective...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Theor. Comput. Sci.
دوره 483 شماره
صفحات -
تاریخ انتشار 2013