On variants of block-sorting compression using context from both the
نویسندگان
چکیده
The block-sorting text compression algorithm can be viewed as associating a context with each character to be compressed, and then sorting the characters on their contexts. Normally, the context associated with each character is the string to the left of the character. Recently, Ratushnyak suggested that it might be possible instead to build a context by interleaving characters taken alternately from the left and right. We show that transformations of this type are not reversible in general unless additional information is supplied. Further, the amount of additional information needed to reverse the transformation is necessarily large, and so such transformations are unlikely to be of interest as part of a compression algorithm.
منابع مشابه
On variants of block-sorting compression using context from both the left and right
The block-sorting text compression algorithm can be viewed as associating a context with each character to be compressed, and then sorting the characters on their contexts. Normally, the context associated with each character is the string to the left of the character. Recently, Ratushnyak suggested that it might be possible instead to build a context by interleaving characters taken alternatel...
متن کاملText Compression using Recency Rank with Context and Relation to Context Sorting, Block Sorting and PPM*
Recently block sorting compression scheme was developed and relation to statistical scheme was studied, but theoretical analysis of performance has not been studied well. Context sorting is a compression scheme based on context similarity and it is regarded as an online version of the block sorting and it is asymptotically optimal. However, the compression speed is slower and the real performan...
متن کاملImprovements to the Block Sorting Text Compression Algorithm
This report presents some further work on the recently described “Block Sorting” lossless or text compression algorithm. It is already known that it is a context-based compressor of unbounded order, but those contexts are completely restructured by the sort phase of the compression. The report examines the effects of those context changes. It is shown that the requirements on the final compress...
متن کاملThe Context Trees of Block Sorting Compression
The Burrows-Wheeler transform (BWT) and block sorting compression are closely related to the context trees of PPM. The usual approach of treating BWT as merely a permutation is not able to fully exploit this relation. We show that an explicit context tree for BWT can be efficiently generating by taking a subset of the corresponding suffix tree, identify the central problems in exploiting its st...
متن کاملGender-oriented Commonalities among Canadian and Iranian Englishes: An Analysis of Yes/No Question Variants
This study investigatesvariability in English yes/no questions as well as the commonalities among yes/no question variants produced by members of two different varieties of English: Canadian English native speakers and Iranian EFL learners.Further, it probes the role of gender in theEnglish yes/no question variants produced by Canadian English native speakers and those produced by Iranian EFL l...
متن کامل