Compression of Compound Documents

نویسنده

  • Ricardo L. de Queiroz
چکیده

Compound (or mixed) document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites etc. Because of the very distinct nature of those two image classes (text/graphics vs. pictures), their compression invariably involves multiple compression systems and a region segmentation (classification) method. We review state-of-the-art technologies on the subject while focusing our attention on the mixed raster content (MRC) multi-layer approach. We also present new results on segmentation for MRC based on optimized rate-distortion-based block thresholding.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MRC Compression of Compound Documents using H.264/AVC-I

The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multi-layer multiresolution representation of a compound document. It is expected that higher compression can be achieved if more efficient compression standards are used to compress each layer. In this paper we present an MRC compound document codec that uses the H.264/AVC operating in INTRA mode to encode back...

متن کامل

Low complexity guaranteed fit compound document compression

We propose a new, very low complexity, single-pass, algorithm for compression of continuous tone compound documents, known as GRAFIT (GuaRAnteed FIT) that can guarantee a minimum compression ratio of as much as 12:1 and even more, for all images in a single pass, while maintaining visually lossless quality when reproduced at resolution 300 dpi or more. The compression ratio is guaranteed in a s...

متن کامل

Comparison of H.264/Avc-Intra Technique for Compound and Natural Image Compression

Currently, the notion of paperless office is being promoted as part of eco-projects in many industries, where paper documents are converted into electronic documents. These images are termed as ‘Compound images’ and are defined as images that contain a combination of text, natural (photo) images and graphic images. The number of documents stored in electronic format is increasing enormously and...

متن کامل

JPEG2000-matched MRC compression of compound documents

The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multilayer decomposition model for compound documents into two contone image layers and a binary mask layer for independent compression. While T.44 does not recommend any procedure for decomposition, it does specify a set of allowable layer codecs to be used after decomposition. While T.44 only allows older stan...

متن کامل

Lossless Compression for Compound Documents Based on Block Classification

Image and video compressions are required to reduce the number of bits needed to represent the content of the original data. Compression of scanned or compound documents and images can be more difficult than the original data because it is a mixture of text, picture and graphics. The main requirement of the compound document or images is quality of the decompressed data. Here Quality is defined...

متن کامل

Document Compression Using H.264/AVC

It has been verified that H.264/AVC, the newest video compression standard, can also be used to encode still images. In many cases, it outperforms state-of-art coders such as JPEG2000. For compound documents, the gains over JPEG2000 are even more expressive. In this scenario, the contributions of the present paper are distributed over four document encoding methods that use the H.264/AVC as a b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999