نتایج جستجو برای: arabic e text
تعداد نتایج: 1252730 فیلتر نتایج به سال:
Text preprocessing is an essential stage in text categorization (TC) particularly and text mining generally. Morphological tools can be used in text preprocessing to reduce multiple forms of the word to one form. There has been a debate among researchers about the benefits of using morphological tools in TC. Studies in the English language illustrated that performing stemming during the preproc...
To date, there are no fully automated systems addressing the community’s need for fundamental language processing tools for Arabic text. In this paper, we present a Support Vector Machine (SVM) based approach to automatically tokenize (segmenting off clitics), part-ofspeech (POS) tag and annotate base phrases (BPs) in Arabic text. We adapt highly accurate tools that have been developed for Engl...
Text summarization based on rhetorical structure theory has shown extremely interesting result. The process of extracting the text summary from the result of the rhetorical parser is not a singleton. Different rhetorical structure trees are generated from one text. Unfortunately, the result of the generated summary is not equivalent for those trees, and the correctness of the result is affected...
Arabizi is Arabic text that is written using Latin characters. Arabizi is used to present both Modern Standard Arabic (MSA) or Arabic dialects. It is commonly used in informal settings such as social networking sites and is often with mixed with English. In this paper we address the problems of: identifying Arabizi in text and converting it to Arabic characters. We used word and sequence-level ...
This paper is a quick review of some of the scholarly work aiming at solving various problems of the Arabic language using neural networks. It includes some research work concerning online recognition of handwritten Arabic characters, speech recognition, offline character text recognition, text categorization and recognition of printed text. This paper concludes that more research should be con...
In this paper we present a survey of the literature on Arabic writer identification scheme and up-to date techniques employed in identification. The paper begins with an overview of the various writer identification schemes in Arabic and Persian languages. After that, an attempt is made to describe the complex character of Arabic strokes. Previous studies have used a number of Arabic datasets c...
The importance of building sentiment analysis tools for Arabic social media has been recognized during the past couple of years, especially with the rapid increase in the number of Arabic social media users. One of the main difficulties in tackling this problem is that text within social media is mostly colloquial, with many dialects being used within social media platforms. In this paper, we p...
The goal of this paper is to present an overview about the thinning problem in Arabic text recognition. Thinning "Skeletonization" is a very crucial stage in the ACR, it simplifies the text shape and reduces the amount of data that needs to be handled and it is usually used as a pre-processing stage for recognition and storage systems. The skeleton of Arabic text can be used for each ...
In this paper we present a system for document understanding and for recognition of printed Arabic text. Arabic characters must be segmented before recognition. We overcome the problem of segmentation by our proposed ORAN system (Offline Recognition of Arabic characters and Numerals). ORAN is based on a method called Modified MCR. Using a stroke index, we can parse compound document images into...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید