Input sensitive thresholding for ancient Hebrew manuscript

نویسنده

  • Itay Bar Yosef
چکیده

In this paper, we describe an input sensitive thresholding algorithm for ancient Hebrew calligraphy documents. Usually, historical document images are of poor quality since the documents have degraded over time due to storage conditions. However, the distribution of noise in one document is not uniform and the characters quality may vary. We develop tools to identify noisy characters and apply more sophisticated tools to process them. First, we use a global thresholding method to obtain an initial binary image. This suffices for noise free characters. Then we evaluate the document characters and invoke an accurate local method only on the noisy characters. Results show that our method detects a very high percent of the noisy characters, and that the local method achieves very accurate results. 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Thresholding of Ancient Degraded Manuscript Folio Images

Thresholding is an essential procedure used in image segmentation and binarization applications. In this paper, segmentation methods applied on document images for separating the text from background presents pure binarization and filtering combined with image processing algorithms. This paper describes a contrast based thresholding method for old degraded manuscript images. It is an approach f...

متن کامل

Isaiah 1:2-20 Scansions with Prosodic Notations

Stichographic arrangements of ancient Hebrew poetry that have come down to us in manuscript tend to dichotomize lines, regardless of whether they consist of two or three versets. This has the advantage of maintaining a consistent arrangement. It has the further effect of emphasizing the major and deemphasizing the minor caesura of a 1:(1:1) or (1:1):1 structure. In the scansions that follow, th...

متن کامل

Comparison of Niblack inspired binarization methods for ancient documents

In this paper, we present a new sliding window based local thresholding technique ‘NICK’ and give a detailed comparison of some existing sliding-window based thresholding algorithms with our method. The proposed method aims at achieving better binarization results, specifically, for ancient document images. NICK has been inspired from the Niblack’s binarization method and exhibits its robustnes...

متن کامل

An Editor of Ancient Texts as Part of the System "Manuscript"

The Information Retrieval System "Manuscript" is intended for storing, editing and processing electronic copies of manuscripts. By retaining all the peculiarities of the ancient treasures, the Manuscript system provides a thorough input of texts/manuscripts under study while preserving the integrity of the original electronic copy of the manuscript, text transcription, and transliteration for t...

متن کامل

Cleaning of Ancient Document Images Using Modified Iterative Global Threshold

Ancient document Image processing is an important area attracting many researchers in the recent period. Binarization is the first step while cleaning the document for further processing. Based on the degradation of the original document, either global or local thresholding methods are preferred. Thresholding phenomenon is a simple and practical approach to identify the cluster of pixels that a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2005