Efficient document rendering with enhanced run length encoding

نویسندگان

  • Guotong Feng
  • Charles A. Bouman
چکیده

Document imaging and transmission systems (typically MFPs) require both effective and efficient image rendering methods that support standard data formats for a variety of document types, and allow for real time implementation. Since most conventional raster formats (e. g. TIFF, PDF, JPEG) are designed for use with either black and white text, or continuous-tone images, more specialized rendering methods are often required for representing mixed content documents. The baseline TIFF format supports a few binary compression options: PackBits, CCITT G3 and G4. Conventionally, halftoning algorithms, such as error diffusion, can be used to create a binary representation of a document image in the TIFF format. However, PackBits, CCITT G3 and G4 compression generally do not produce desired compression on halftone images. In this paper, we propose an efficient error diffusion algorithm optimized for PackBits compression. This method, which we refer to as POED (PackBits optimized error diffusion), is a form of threshold modulation error diffusion which takes advantage of the byte-oriented run length structure of PackBits compression by encouraging repetition of bytes in the resulting binary image. To maintain the sharpness of text, a binary segmentation algorithm is provided to switch off the adaptive error diffusion procedure and switch on the Floyd Steinberg error diffusion procedure in text regions. The POED method with PackBits compression yields higher compression ratios than the conventional error diffusion method, while maintaining desirable visual quality with low computational and memory requirements. We show experimental results to compare our method with the Floyd Steinberg error diffusion method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhanced Constrained Run-Length Algorithm for Complex Layout Document Processing

The Constrained Run-Length Algorithm (CRLA) is a well-known technique for page segmentation. The algorithm is very efficient for partitioning documents with Manhattan layouts but not suited to deal with complex layout pages, e.g. irregular graphics embedded in a text paragraph. Its main drawback is to use only local information during the smearing stage, which may lead to erroneous linkage of t...

متن کامل

TRLE - An Efficient Data Compression Scheme for Image Composition of Parallel Volume Rendering Systems

In this paper, we present an efficient data compression scheme, the template run-length encoding (TRLE) scheme, for image composition of parallel volume rendering systems. Given an image with 2n×2n pixels, in the TRLE scheme, the image is treated as n×n blocks and each block has 2×2 pixels. Since a pixel can be a blank or non-blank pixel, there are 16 templates in a block. To compress an image,...

متن کامل

Efficient Binary and Run Length Morphology and its Application to Document Image Processing

This paper describes the implementation and evaluation of an open source library for mathematical morphology based on packed binary and run-length compressed images for document imaging applications. Abstractions and patterns useful in the implementation of the interval operations are described. A number of benchmarks and comparisons to bit-blit based implementations on standard document images...

متن کامل

An improved binary-swap compositing for sort-last parallel rendering on distributed memory multiprocessors

Sort-last parallel rendering is a good rendering scheme on distributed memory multiprocessors. This paper presents an improvement on the binary-swap (BS) method, which is an efficient image compositing algorithm for sort-last parallel rendering. Our compositing method uses three acceleration techniques, compared to the original BS method: (1) the interleaved splitting, (2) multiple bounding rec...

متن کامل

Template-based rendering of run-length encoded volumes

Template-based volume rendering is an acceleration technique for volume ray casting. It does not trade-off the image quality for the rendering speed. However it is still fall short of interactive manipulation of volume data, mainly due to the ray-by-ray volume access pattern and the long ray path in the transparent regions. In this paper, we present an object-order template-based volume renderi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006