Evaluation of Holistic Features for Word Recognition

نویسندگان

  • José Josemar de Oliveira
  • João Marques de Carvalho
چکیده

This work proposes a testing environment for evaluation of holistic methods for word recognition. The main goal is to determine an optimun set of holistic characteristics to represent the months of the year in Brazilian Portuguese language. The holistic characteristics treat the words as single, indivisible entities and attempts to perform recognition from their overall shape, as opposed to their character contents. No segmentation step is needed in holistic recognition systems. 1 Relevance of holistic methods Holistic approaches circumvent problems of segmentation ambiguity and variability of segment shape, because they make no attempt to segment the word into subunits. Instead, they rely on word-level features and matching to determine the identity of the word. 1.1 Choice of database In recognition of handwritten bankchecks, one of the problems is to recognize the date field, that is formed by words (cities and months) and numerical characters. The task of recognizing the name of the month is adequated for holistic application, since that involves a small lexicon. There is consensus that the utility of the holistic approach is either in the small lexicon cases or in the filtering of large lexicons. 2 Description of testing environment The definition of an optimum feature set is important since there is no standard set of representative characteristics for the task of handwriting recognition. The testing environment proposed is presented in Figure 1. The main function is to analyse the different characteristic sets in order to perform their validation. The fundamental parts of the system are: Image Samples of words obtained from specific forms that been digitilized by a scanner dispositive. Figure 2 ilustrates some samples already obtained. A database with 6000 images has been constructed to be used in this work; Pre-processing Different methods applied to the image to perform noise elimination and slant normalization; Characteristics Extractor Different methods that extract the holistic characteristics of the words, such as, estimated word length, number and position of ascenders and descenders; Neural Classificator Neural network (NN) atributes a confidence measure to the image. For each characteristic set, the NN is trained and tested. The best set is the one that maximizes recognition rate. Characteristics Extractor Neural Classificator Image Pre−processing

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Holistic Farsi handwritten word recognition using gradient features

In this paper we address the issue of recognizing Farsi handwritten words. Two types of gradient features are extracted from a sliding vertical stripe which sweeps across a word image. These are directional and intensity gradient features. The feature vector extracted from each stripe is then coded using the Self Organizing Map (SOM). In this method each word is modeled using the discrete Hidde...

متن کامل

Top-Down Likelihood Word Image Generation Model for Holistic Word Recognition

This paper describes a new top-down word image generation model for word recognition. This model can generate a word image with a likelihood based on linguistic knowledge, segmentation and character image. In the recognition process, first, the model generates the word image which approximates an input image best for each of a dictionary of possible words. Next, the model calculates the distanc...

متن کامل

The Role of Holistic Paradigms in Handwritten Word Recognition

ÐThe Holistic paradigm in handwritten word recognition treats the word as a single, indivisible entity and attempts to recognize words from their overall shape, as opposed to their character contents. In this survey, we have attempted to take a fresh look at the potential role of the Holistic paradigm in handwritten word recognition. The survey begins with an overview of studies of reading whic...

متن کامل

A Holistic Approach for Handwritten Hindi Word Recognition

Holistic word recognition attempts to recognize the entire word image as a single pattern. In general, it performs better than segmentation based word recognition model for known, fixed and small sized lexicon. The present work deals with recognition of handwritten words in Hindi in holistic way. Features like area, aspect ratio, density, pixel ratio, longest run, centroid and projection length...

متن کامل

Holistic Verification of Handwritten Phrases

ÐIn this paper, we describe a system for rapid verification of unconstrained off-line handwritten phrases using perceptual holistic features of the handwritten phrase image. The system is used to verify handwritten street names automatically extracted from live U.S. mail against recognition results of analytical classifiers. Presented with a binary image of a street name and an ASCII street nam...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001