Extracting halftones from printed documents using texture analysis

نویسندگان

  • Dennis F. Dunn
  • Thomas P. Weldon
  • William E. Higgins
چکیده

Separating halftones from text is an important step in document analysis. We present an algorithm that accurately extracts halftones from other information in printed documents. We treat halftone extraction as a texture-segmentation problem. We show that commonly used halftones, consisting of a pattern of dots, can be viewed as a texture. This texture exhibits a distinct spectral component that can be detected using a properly-tuned Gabor lter. The Gabor lter essentially transforms halftones into high-contrast regions that can be isolated by thresholding. We propose a lter-design procedure and provide experimental results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Show-through watermarking of duplex printed documents

A technique for watermarking duplex printed pages is presented. The technique produces visible watermark patterns like conventional watermarks embedded in paper fabric. Watermark information is embedded in halftones used to print images on either side. The watermark pattern is imperceptible when images printed on either side are viewed independently but becomes visible when the sheet of paper i...

متن کامل

Global Approach for Script Identification using Wavelet Packet Based Features

In a multi script environment, an archive of documents having the text regions printed in different scripts is in practice. For automatic processing of such documents through Optical Character Recognition (OCR), it is necessary to identify different script regions of the document. In this paper, a novel texture-based approach is presented to identify the script type of the collection of documen...

متن کامل

Wavelet Packet Based Texture Features for Automatic Script Identification

In a multi script environment, an archive of documents printed in different scripts is in practice. For automatic processing of such documents through Optical Character Recognition (OCR), it is necessary to identify the script type of the document. In this paper, a novel texture-based approach is presented to identify the script type of the collection of documents printed in ten Indian scripts ...

متن کامل

Entropy Based Texture Features Useful for Automatic Script Identification

In a multi script environment, a collection of documents printed in different scripts is in practice. For automatic processing of such documents through Optical Character Recognition, it is necessary to identify the script type of the document. In this paper, a novel texture-based approach is presented to identify the script type of the documents printed in three prioritized scripts Kannada, Hi...

متن کامل

Texture based attacks on intrinsic signature based printer identification

Several methods exist for printer identification from a printed document. We have developed a system that performs printer identification using intrinsic signatures of the printers. Because an intrinsic signature is tied directly to the electromechanical properties of the printer, it is difficult to forge or remove. There are many instances where existance of the intrinsic signature in the prin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996