Online Construction of Subsequence Automata for Multiple Texts

نویسندگان

  • Hiromasa Hoshino
  • Ayumi Shinohara
  • Masayuki Takeda
  • Setsuo Arikawa
چکیده

We consider a deterministic finite automaton which accepts all subsequences of a set of texts, called subsequence automaton. We show an online algorithm for constructing subsequence automaton for a set of texts. It runs in O(|Σ|(m+ k) +N) time using O(|Σ|m) space, where |Σ| is the size of alphabet, m is the size of the resulting subsequence automaton, k is the number of texts, and N is the total length of texts. It can be used to preprocess a given set S of texts in such a way that for any query w ∈ Σ∗, returns in O(|w|) time the number of texts in S which contain w as a subsequence. We also show an upper bound of the size of automaton compared to the minimum automaton.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online construction of subsequence automata for multiple texts by Hiromasa Hoshino , Ayumi Shinohara , Masayuki

We consider a deterministic finite automaton which accepts all subsequences of a set of texts, called subsequence automaton. We show an online algorithm for constructing subsequence automaton for a set of texts. It runs in O(|Σ|m) time using O(|Σ|m) space, where m is the number of states of the subsequence automaton, and |Σ| is the size of alphabet. For k texts of length n, since m = O(n), it r...

متن کامل

Online construction of subsequence automata for multiple texts by Hiromasa Hoshino

We consider a deterministic finite automaton which accepts all subsequences of a set of texts, called subsequence automaton. We show an online algorithm for constructing subsequence automaton for a set of texts. It runs in O(|Σ|(m + k) + N) time using O(|Σ|m) space, where |Σ| is the size of alphabet, m is the size of the resulting subsequence automaton, k is the number of texts, N is the total ...

متن کامل

Subsequence Automata with Default Transitions

Let S be a string of length n with characters from an alphabet of size σ. The subsequence automaton of S (often called the directed acyclic subsequence graph) is the minimal deterministic finite automaton accepting all subsequences of S. A straightforward construction shows that the size (number of states and transitions) of the subsequence automaton is O(nσ) and that this bound is asymptotical...

متن کامل

On-line & Incremental Update Properties of the Subsequence Automaton

Many works deal with the subsequence matching problem using automata structures. It is to decide, given two sequences s and t, whether s is a subsequence of t. Automata like the Directed Acyclic Subsequence Graph (dasg) or the Subsequence Automaton (sa) accept all subsequences of a set of texts. We focus on this last structure and provide some useful results upon dynamically updates of the sa. ...

متن کامل

Finding Frequent Subsequences in a Set of Texts

Given a set of strings, the Common Subsequence Automaton accepts all common subsequences of these strings. Such an automaton can be deduced from other automata like the Directed Acyclic Subsequence Graph or the Subsequence Automaton. In this paper, we introduce some new issues in text algorithm on the basis of Common Subsequences related problems. Firstly, we make an overview of different exist...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000