Phrase Based Direct Model for Improving Handwriting Recognition Accuracies
نویسندگان
چکیده
We propose a method for increasing word recognition accuracies by correcting the output of a handwriting recognition system. We treat the handwriting recognizer as a black-box, such that there is no access to its internals. This enables us to keep our algorithm general and independent of any particular system. We use a novel method for correcting the output based on a direct “phrase-based” system in contrast to traditional sourcechannel models. We report the accuracies of an in-house handwritten word recognizer before and after the correction. We achieve highly encouraging results for a large
منابع مشابه
Influence of Word Length on Handwriting Recognition
Two strategies can be considered in handwriting recognition: phrase or word approaches. In this paper we want to demonstrate the superiority of the phrase one, especially in city name recognition. The performances of an HMM-based off-line system using an analytic approach with explicit segmentation are evaluated on 2 databases: (i) city names in full and (ii) city names in single words. A diffe...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملAn XML Representation for Annotated Handwriting Datasets for Online Handwriting Recognition
In this paper, we briefly descibe an XML representation for annotation of online handwriting data to support the development and evaluation of handwriting recognition algorithms, that is based on the emerging Digital Ink Markup Language (InkML) draft standard from W3C. In particular, we describe how the XML representation we have defined attempts to address issues of (i) support for different s...
متن کاملA New Strategy for Improving Feature Sets in a Discrete Hmm-based Handwriting Recognition System
In this paper we introduce a new strategy for improving a discrete HMM-based handwriting recognition system, by integrating several information sources from specialized feature sets. For a given system, the basic idea is to keep the most discriminative features, and to replace the others with new ones obtained from new feature spaces. After evaluating the individual discriminative power of each...
متن کاملImproved Modeling in Handwriting Recognition
In this work a script independent handwriting recognition system is proposed which is derived from the RWTH-ASR hidden Markov model (HMM) based speech recognizer. Most problems occurring in handwriting recognition (HWR) are induced by large variations within the written text. In particular, different handwriting styles such as cursive writing or long drawn-out strokes are difficult to model. Co...
متن کامل