Inferring an indeterminate string from a prefix graph

نویسندگان

  • Ali Alatabbi
  • Mohammad Sohel Rahman
  • William F. Smyth
چکیده

An indeterminate string (or, more simply, just a string) x = x[1..n] on an alphabet Σ is a sequence of nonempty subsets of Σ. We say that x[i1] and x[i2] match (written x[i1] ≈ x[i2]) if and only if x[i1]∩x[i2] 6= ∅. A feasible array is an array y = y[1..n] of integers such that y[1] = n and for every i ∈ 2..n, y[i] ∈ 0..n−i+1. A prefix table of a string x is an array π = π[1..n] of integers such that, for every i ∈ 1..n, π[i] = j if and only if x[i..i+j−1] is the longest substring at position i of x that matches a prefix of x. It is known from [CRSW14] that every feasible array is a prefix table of some indetermintate string. A prefix graph P = Py is a labelled simple graph whose structure is determined by a feasible array y. In this paper we show, given a feasible array y, how to use Py to construct a lexicographically least indeterminate string on a minimum alphabet whose prefix table π = y.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indeterminate strings, prefix arrays & undirected graphs

An integer array y = y[1..n] is said to be feasible if and only if y[1] = n and, for every i ∈ 2..n, i ≤ i+y[i] ≤ n+1. A string is said to be indeterminate if and only if at least one of its elements is a subset of cardinality greater than one of a given alphabet Σ; otherwise it is said to be regular. A feasible array y is said to be regular if and only if it is the prefix array of some regular...

متن کامل

Enhanced Covers of Regular and Indeterminate Strings Using Prefix Tables

A cover of a string x = x[1..n] is a proper substring u of x such that x can be constructed from possibly overlapping instances of u. A recent paper [12] relaxes this definition — an enhanced cover u of x is a border of x (that is, a proper prefix that is also a suffix) that covers a maximum number of positions in x (not necessarily all) — and proposes efficient algorithms for the computation o...

متن کامل

String Inference from Longest-Common-Prefix Array

The suffix array, perhaps the most important data structure in modern string processing, is often augmented with the longest common prefix (LCP) array which stores the lengths of the LCPs for lexicographically adjacent suffixes of a string. Together the two arrays are roughly equivalent to the suffix tree with the LCP array representing the tree shape. In order to better understand the combinat...

متن کامل

Enhanced Covers of Regular & Indeterminate Strings using Prefix Tables

A cover of a string x = x[1..n] is a proper substring u of x such that x can be constructed from possibly overlapping instances of u. A recent paper [12] relaxes this definition — an enhanced cover u of x is a border of x (that is, a proper prefix that is also a suffix) that covers a maximum number of positions in x (not necessarily all) — and proposes efficient algorithms for the computation o...

متن کامل

Computing Covers Using Prefix Tables

An indeterminate string x = x[1..n] on an alphabet Σ is a sequence of nonempty subsets of Σ; x is said to be regular if every subset is of size one. A proper substring u of regular x is said to be a cover of x iff for every i ∈ 1..n, an occurrence of u in x includes x[i]. The cover array γ = γ[1..n] of x is an integer array such that γ[i] is the longest cover of x[1..i]. Fifteen years ago a com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Discrete Algorithms

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2015