Compressed Index for a Dynamic Collection of Texts
نویسندگان
چکیده
Let T be a string with n characters over an alphabet of bounded size. The recent breakthrough on compressed indexing allows us to build an index for T in optimal space (i.e., O(n) bits), while supporting very efficient pattern matching [2, 4]. This paper extends the work on optimal-space indexing to a dynamic collection of texts. Precisely, we give a compressed index using O(n) bits where n is the total length of texts, such that searching for a pattern P takes O(|P | logn+ occ log n) time where occ is the number of occurrences, and inserting or deleting a text T takes O(|T | logn) time.
منابع مشابه
Dynamic FM-Index for a Collection of Texts with Application to Space-efficient Construction of the Compressed Suffix Array
متن کامل
A compressed dynamic self-index for highly repetitive text collections
We present a novel compressed dynamic self-index for highly repetitive text collections. Signature encoding, an existing self-index of this type, has a large disadvantage of slow pattern search for short patterns. We obtain faster pattern search by leveraging the idea behind a truncated suffix tree (TST) to develop the first compressed dynamic self-index, called the TST-index, that supports not...
متن کاملSelf-Index based on LZ77 (thesis)
Domains like bioinformatics, version control systems, collaborative editing systems (wiki), and others, are producing huge data collections that are very repetitive. That is, there are few differences between the elements of the collection. This fact makes the compressibility of the collection extremely high. For example, a collection with all different versions of a Wikipedia article can be co...
متن کاملDynamic Modelling of a Compressed Air Energy Storage System in a Grid Connected Photovoltaic Plant
The use of photovoltaic (PV) cells in domestic and industrial applications has grown rapidly through the recent years. Constructing PV plants is a very smart measure to produce free electricity in large scales, especially in the countries with higher solar irradiation potential. On the other hand, compressed air energy storage (CAES) has already been proposed to be employed for energy storage a...
متن کاملIndexing Compressed Text
As a result of the rapid growth of the volume of electronic data, text compression and indexing techniques are receiving more and more attention. These two issues are usually treated as independent problems, but approaches of combining them have recently attracted the attention of researchers. In this thesis, we review and test some of the more effective and some of the more theoretically inter...
متن کامل