A Harmony Search Algorithm for Recognition-Based Segmentation of Online Arabic Text

نویسندگان

  • Moayad Yousif Potrus
  • Umi Kalthum Ngah
چکیده

In this paper a Harmony Search algorithm (HS) is used for online Arabic text recognition. The algorithm is divided into two phases: text segmentation using dominant point detection and character recognition using HS. The segmentation algorithm uses dominant point detection to mark minimal number of points which could form the text skeleton. Then, the generated text skeleton is expressed as a directional model with 6 directions. This directional model minimizes the directions opposite to the writing direction. As a result, the new text directional expression will exploit all the possible segmentation points. Finally, HS is used to match the best database character to the target character generated from the segmentation process by minimizing the total score obtained from the overall text matching. The system is tested using a database of 4500 words forming 21234 characters in different positions or forms (isolated, start, middle and end). The data set is divided into a set of 3000 words for training and 1500 words for testing. The algorithm scored a 93.4% successful word recognition rate with an execution time of 4.3 sec.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Text Summarization Using Cuckoo Search Optimization Algorithm

Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...

متن کامل

Multi-Stage Fuzzy Load Frequency Control Based on Multi-objective Harmony Search Algorithm in Deregulated Environment

A new Multi-Stage Fuzzy (MSF) controller based on Multi-objective Harmony Search Algorithm (MOHSA) is proposed in this paper to solve the Load Frequency Control (LFC) problem of power systems in deregulated environment. LFC problem are caused by load perturbations, which continuously disturb the normal operation of power system. The objectives of LFC are to mini small size the transient deviati...

متن کامل

An Adaptive Algorithm for the Automatic Segmentation of Printed Arabic Text

Character segmentation is a crucial step in most Arabic optical text recognition systems. The recognition process depends mainly on the accuracy of the character segmentation. This paper presents a novel adaptive algorithm for the off-line segmentation of printed Arabic text. There are many challenging features in the Arabic writing, for example, it is cursive and characters in a word can take ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012