The use of Radon Transform in Handwritten Arabic (Indian) Numerals Recognition
نویسندگان
چکیده
This paper describes a technique for the recognition of off-line handwritten Arabic (Indian) numerals using Radon and Fourier Transforms. Radon-Fourier-based features are used to represent Arabic digits. Nearest Mean Classifier (NMC), K-Nearest Neighbor Classifier (K-NNC), and Hidden Markov Models Classifier (HMMC) are used. Analysis using different number of projections, varying the number of Radonbased features, and the number of samples used in the training and testing of this technique is presented using the NMC and K-NNC. A database of 44 writers with 48 samples per digit each totaling 21120 samples are used for training and testing of this technique. The training and testing of the HMMC is different than that of the NMC and KNNC in its internal working and in the way data is presented to the classifier. Since the digits have equal probability the randomization of the digits is necessary in the training of the HMMC. 80% of the data was used in training and the remaining 20% in testing of the HMMC. Radon-based features are extracted from Arabic numerals and used in training and testing of the HMM. In this work we didn’t follow the general trend, in HMMC, of using sliding windows in the direction of the writing line to generate features. Instead we generated features based on the digit as a unit. Several experiments were conducted for estimating the suitable number of states for the HMM. In addition, we experimented with different number of observations per digit. The Radon-Fourier-based features proved to be simple and effective. The classification errors were analyzed. The majority of errors were due to the misclassification of digit 7 with 8 and vice versa. Hence, a second Structural Classifier is used in a cascaded (second) stage for the NMC, K-NNC, and HMMC. This stage, which is based on the structural attributes of the digits, enhanced the average overall recognition rate from 3.1% to 4.05% (Recognition rates of 98.66%, 98.33%, 97.1% for NMC, K-NNC, HMMC, respectively). Key-Words: Arabic numeral recognition, OCR, Hidden Markov Models, Handwritten Digit recognition, Nearest neighbor classifier.
منابع مشابه
Recognition of Handwritten Arabic (Indian) Numerals using Radon- Fourier-based Features
This paper describes a technique for the recognition of off-line handwritten Arabic (Indian) numerals using Radon-Fourier-based features. A two stage classification scheme is used. The Nearest Mean (NMC), K-Nearest Neighbor (K-NNC), and Hidden Markov Models (HMMC) Classifiers are used in the first stage and a Structural Classifier (SC) is used in the second stage. A database of 44 writers with ...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملIsolated Handwritten Roman Numerals Recognition Using Methods Based on Radon, Hough Transforms and Gabor Filter
This paper presents for isolated handwritten Roman numerals recognition a research interested for carrying out both comparisons between the performances in terms of precision and rapidity, the first comparison is realized between four hybrid methods used to extract the features from numerals that are the zoning combined with Radon transform in first time, then combined with Hough transform in s...
متن کاملUse of the Shearlet Transform and Transfer Learning in Offline Handwritten Signature Verification and Recognition
Despite the growing growth of technology, handwritten signature has been selected as the first option between biometrics by users. In this paper, a new methodology for offline handwritten signature verification and recognition based on the Shearlet transform and transfer learning is proposed. Since, a large percentage of handwritten signatures are composed of curves and the performance of a sig...
متن کاملAutomatic Recognition of Off-line Handwritten Arabic (Indian) Numerals Using Support Vector and Extreme Learning Machines
This paper describes a technique using Support Vector (SVM) and Extreme Learning Machines (ELM) for automatic recognition of off-line handwritten Arabic (Indian) numerals. The features of angle, distance, horizontal, and vertical span are extracted from these numerals. The database has 44 writers with 48 samples of each digit totaling 21120 samples. A two-stage exhaustive parameter estimation t...
متن کامل