Cursive Digit and Character Recognition on Cedar Database
نویسندگان
چکیده
This paper uses a modified Hough transform method to extract features from the cursive handwritten digit and characters CEDAR data. The technique does not require the detection of complex structural primitives such as loops etc. The handwriting images are divided into uniform regions that are analysed for the presence of horizontal, vertical and diagonal segments. The total number of such segments found in these regions are used as input to a linear classifier (discriminant analysis) and a non-linear classifier (nearest neighbour). The results are produced on the complete test sets specified in CEDAR database as well as a leave-one-out crossvalidation is performed. On the digit data, the results show a recognition rate of around 94% correct recognition on the test set and 87.5% using a leave-oneout method. Character recognition results range between 67% correct on test set and 64% correct using leave-oneout cross-validation.
منابع مشابه
Handwritten character recognition through two-stage foreground sub-sampling
In this paper, we present a methodology for off-line handwritten character recognition. The proposed methodology relies on a new feature extraction technique based on recursive subdivisions of the character image so that the resulting sub-images at each iteration have balanced (approximately equal) numbers of foreground pixels, as far as this is possible. Feature extraction is followed by a two...
متن کاملCursive Character Challenge: a New Database for Machine Learning and Pattern Recognition
Cursive character recognition is a challenging task due to high variability and intrinsic ambiguity of cursive letters. This paper presents C-Cube (Cursive Character Challenge), a new public-domain cursive character database. C-Cube contains 57293 cursive characters manually extracted from cursive handwritten words, including both upper and lower case versions of each letter. The database can b...
متن کاملCursive character recognition by learning vector quantization
This paper presents a cursive character recognizer embedded in an o-line cursive script recognition system. The recognizer is composed of two modules: the ®rst one is a feature extractor, the second one a learning vector quantizer. The selected feature set was compared to Zernike polynomials using the same classi®er. Experiments are reported on a database of about 49,000 isolated characters.
متن کاملCursive Script Postal Address Recognition Abstract Cursive Script Postal Address Recognition
Cursive Script Postal Address Recognition By Prasun Sinha Large variations in writing styles and di culty in segmenting cursive words are the main reasons for cursive script postal address recognition being a challenging task A scheme for locating and recognizing words based on over segmentation followed by dynamic programming is proposed This technique is being used for zip code extraction as ...
متن کاملA New Arabic (ahd/amsh) Handwritten Database
This paper introduces new database for Arabic handwritten words. The Arabic handwritten database (AHD/AMSH) represents a utility to facilitate the experiments of the character recognition algorithms. It contains three types of images: word, isolated character, and digit images. The AHD/AMSH can be used for baseline detection, characters segmentation, normalization, thinning, training and testin...
متن کامل