Feature Selection and Model Design through GA Applied to Handwritten Digit Recognition from Historical Document Images
نویسندگان
چکیده
This paper presents a genetic algorithm-based approach that integrates a radial basis function kernel support vector machine applied to pattern recognition. The proposed approach performs feature selection on handwritten digits from historical document images and model design on the adopted support vector machine in order to obtain the best possible recognition performance with the minimum possible feature selection subset. The results generated by this research display that the proposed approach is efficient in finding the parameter set of the support vector machine and the feature selection subset that optimize the pattern distinction into their respective classes. Additionally, this method produces solutions that achieve rate of correct digit classification of 95.10% on average and obtain a pattern representation with approximately 50% reduction in dimensionality regarding the original representation. This paper contributes to the preservation and diffusion of historical documents. Our goal is to recognize handwritten dates in documents for indexing purposes.
منابع مشابه
Persian Handwritten Digit Recognition Using Particle Swarm Probabilistic Neural Network
Handwritten digit recognition can be categorized as a classification problem. Probabilistic Neural Network (PNN) is one of the most effective and useful classifiers, which works based on Bayesian rule. In this paper, in order to recognize Persian (Farsi) handwritten digit recognition, a combination of intelligent clustering method and PNN has been utilized. Hoda database, which includes 80000 P...
متن کاملAmbiguity Reduction through Optimal Set of Region Selection Using GA and BFO for Handwritten Bangla Character Recognition
To recognize different patterns, identification of local regions where the pattern classes differ significantly is an inherent ability of the human cognitive system. This inherent ability of human beings may be imitated in any pattern recognition system by incorporating the ability of locating the regions that contain the maximum discriminating information among the pattern classes. In this cha...
متن کاملNeural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten
Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...
متن کاملSequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR
Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...
متن کاملSequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR
Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...
متن کامل