A Comparison of Techniques for Automatic Clustering of Handwritten Characters
نویسندگان
چکیده
This work reports experiments with four hierarchical clustering algorithms and two clustering indices for online handwritten characters. The main motivation of the work is to develop an automatic method for finding a set of prototypical characters which would represent well the different writing styles present in a large international database. One of the major obstacles in achieving this goal is the uneven representation of different writing styles in the database. On the basis of the results of the experiments, we claim that a good set of prototypes can be formed from the combined results of the different clustering algorithms. However, the number of clusters cannot be determined automatically but some human intervention is required.
منابع مشابه
Experiments with a Self-supervised Adaptive Classification Strategy in On-line Recognition of Isolated Handwritten Latin Characters
Results on a comparison of recognition techniques for on-line recognition of handwritten Latin alphabets are presented. The emphasis is on an adaptive classii-cation strategy introduced in this paper. The classiication strategy is based on compressing or distilling a large database of handwritten characters to a small set of character prototypes. The distillation is performed as a clustering pr...
متن کاملPersian Handwritten Digit Recognition Using Particle Swarm Probabilistic Neural Network
Handwritten digit recognition can be categorized as a classification problem. Probabilistic Neural Network (PNN) is one of the most effective and useful classifiers, which works based on Bayesian rule. In this paper, in order to recognize Persian (Farsi) handwritten digit recognition, a combination of intelligent clustering method and PNN has been utilized. Hoda database, which includes 80000 P...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملImplementation of Feed-forward Neural Network Models for Pattern Classification Using Transformation Based Feature Extraction Methods
Automatic recognition of handwritten Hindi characters is a difficult and one of the most interesting research areas of pattern recognition field. A lot of work has been done in this area till date; still it is a subject of active research. Hindi characters are cursive in nature and thus characters may be written in various cursive ways. Characters also show a lot of similar features such as hea...
متن کاملOnline Recognition of Handwritten Korean and English Characters
In this study, an improved HMM based recognition model is proposed for online English and Korean handwritten characters. The pattern elements of the handwriting model are sub character strokes and ligatures. To deal with the problem of handwriting style variations, a modified Hierarchical Clustering approach is introduced to partition different writing styles into several classes. For each of t...
متن کامل