DNA-words and word posets

نویسندگان

  • Péter Ligeti
  • Péter Sziklai
چکیده

In the paper two variants of a combinatorial problem for the set F q of sequences of length n over the alphabet Fq = {0, 1, .., q − 1} are considered, with some applications. The original problem was the following: what is the smallest k such that every word v ∈ F q is uniquely determined by the set of its subwords of length up to k. This problem was solved by Lothaire [1]. We consider the following variant of this problem: the n-letter word w = w1...wn (which is called a DNA-word) is composed over an alphabet consisting of q complement pairs:{i, ī : i = 0, .., q− 1}; and denote by w its reverse complement, i.e. w = w̄n...w̄1. A DNA-word u is called a subword of w if it is a subword of either w or w. As above, we’re looking for the smallest k. We give a simple proof for k = n − 1, and apply this result for determining the automorphism group of the poset of DNA-words of length at most n, partially ordered by the above subword relation. Furthermore we give a sharp result k ∼ 2n/3, which is an analogue of the former result [1].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Finite Word Poset

Our word posets have finite words of bounded length as their elements, with the words composed from a finite alphabet. Their partial ordering follows from the inclusion of a word as a subsequence of another word. The elemental combinatorial properties of such posets are established. Their automorphism groups are determined (along with similar result for the word poset studied by Burosch, Frank ...

متن کامل

Algorithmics of Posets Generated by Words Over Partially Commutative Alphabets (Extended)

It is natural to relate partially ordered sets (posets in short) and classes of equivalent words over partially commutative alphabets. Their common graphical representation are Hasse diagrams. We investigate this relation in detail and propose an efficient online algorithm that decompresses a concurrent word to its Hasse diagram. The lexicographically minimal representative of a trace (an equiv...

متن کامل

Word posets, with applications to Coxeter groups

We discuss the theory of certain partially ordered sets that capture the structure of commutation classes of words in monoids. As a first application, it follows readily that counting words in commutation classes is #P-complete. We then apply the partially ordered sets to Coxeter groups. Some results are a proof that enumerating the reduced words of elements of Coxeter groups is #P-complete, a ...

متن کامل

Double homotopy Cohen-Macaulayness for the poset of injective words and the classical NC-partition lattice

In this paper we study topological properties of the poset of injective words and the lattice of classical non-crossing partitions. Specifically, it is shown that after the removal of the bottom and top elements (if existent) these posets are doubly Cohen-Macaulay. This extends the well-known result that those posets are shellable. Both results rely on a new poset fiber theorem, for doubly homo...

متن کامل

Presenting Finite Posets

String rewriting systems have been originally introduced by Thue [21] in order to study word problems in monoids. A string rewriting system (Σ,R) consists of a set Σ, called the alphabet, and a set R ⊆ Σ∗×Σ∗ of rules. The monoid Σ∗/≡R, obtained by quotienting the free monoid Σ∗ over Σ by the smallest congruence (wrt concatenation) containing R, is called the monoid presented by the rewriting sy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008