A Syntactic Omni-Font Character Recognition System
نویسنده
چکیده
This paper introduces a syntactic omni-font character recognition system. The "omnifont" attribute reflects the wide range of fonts that fall within the class of characters that can be recognized. This includes hand-printed characters as well. A structural pattern-matching approach is employed. Essentially, a set of loosely constrained rules specify pattern components and their interrelationships. The robustness of the system is derived from the orthogonal set of pattern descriptors, location functions, and the manner in which they are combined to exploit the topological structure of characters. By virtue of the new pattern description language, POL, developed in this paper, the user may easily write rules to define new patterns for the system to recognize. The system also features scale-invariance and user-definable sensitivity to tilt orientation.
منابع مشابه
Optical Font Recognition from Projection Profiles
• Recognition of logical document structures [1], where knowledge of the font used in a word, line, or text block may be useful for defining its logical label (chapter title, section title or paragraph). • Document reproduction, where knowledge of the font is necessary in order to reproduce (reprint) the document. • Document indexing and information retrieval, where word indexes are generally p...
متن کاملA Robust Free Size OCR for Omni-Font Persian/Arabic Printed Document Using Combined MLP/SVM
Optical character recognition of cursive scripts present a number of challenging problems in both segmentation and recognition processes and this attracts many researches in the field of machine learning. This paper presents a novel approach based on a combination of MLP and SVM to design a trainable OCR for Persian/Arabic cursive documents. The implementation results on a comprehensive databas...
متن کاملOptimum Design Parameters of the Classifiers for Omni-Font Machine-Printed Numeral Recognition Based on the Minimum Classification Error Criterion
Abs t rac t The optimal design parameters of classifiers for omni-font machine-printed numeral recognition based on the minimum classification error (MCE) criterion are determined experimentall y. The design parameters that influence the accuracy of an optical character reader (OCR) are: similarity measure (or distance measure), kinds of features, dimension of the feature vector, method of trai...
متن کاملFont Recognition of Chinese Character Based on Multi-Scale Wavelet
Optical character recognition system research has been acquired howling success, but the reconstruction of layout needs fonts of the characters. In this paper, a novel font recognition algorithm is proposed, which is based on multi-scale wavelet analysis. We adopt wavelet analysis and the grid method to deal with the character image, and extract wavelet energy density feature, and apply the BP ...
متن کاملOptical Font Recognition for Multi-Font OCR and Document Processing
In this paper we present a Multi-font OCR system to be employed for document processing, which performs, at the same time, both the character recognition and the font-style detection of the digits belonging to a subset of the existing fonts. The detection of the font-style of the document words can guide a rough automatic classification of documents, and can also be used to improve the characte...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJPRAI
دوره 1 شماره
صفحات -
تاریخ انتشار 1987