Using Fuzzifiers to Solve Word Sense Ambiguation in Arabic Language
نویسنده
چکیده
Text mining techniques confront many challenges when dealing with the Arabic language including lexical disambiguation because Arabic is a highly inflectional and derivational language, most of the Arabic texts are devoid of diacritics especially Modern Standard Arabic (MSA), thus, it is a must to depend on the ambiguous word context under study. Two fuzzy logic classifiers have been built and compared to a supervised corpusbased Naïve Bayes classifier. The study concludes that the results that have been obtained from our fuzzy logic classifiers are more accurate and promising.
منابع مشابه
Hiding a Semantic Class Hierarchy in a Markov Model
This paper introduces a new model of selectional preference induction. Unlike previous approaches we provide a stochastic generation model for the words that appear as arguments of a predicate. More speciically, we deene a hidden Markov model with the general shape of our semantic class hierarchy and we use an EM-like algorithm for parameter estimation. This model enables us to handle word sens...
متن کاملNaïve Bayes Classifier for Arabic Word Sense Disambiguation
Word Sense Disambiguation (WSD) is the process of selecting a sense of an ambiguous word in a given context from a set of predefined senses. Sense Inventory usually comes from a dictionary or thesaurus. In Arabic, the main cause of word ambiguity is the lack of diacritics of the most digital documents so the same word can occurs with different senses. In this paper, we use the rooting algorithm...
متن کاملBuilding A Training Corpus For Word Sense Disambiguation In English-To-Vietnamese Machine Translation
The most difficult task in machine translation is the elimination of ambiguity in human languages. A certain word in English as well as Vietnamese often has different meanings which depend on their syntactical position in the sentence and the actual context. In order to solve this ambiguation, formerly, people used to resort to many hand-coded rules. Nevertheless, manually building these rules ...
متن کاملIs Hesaabdaaree an Adequate Equivalent for Accounting?
Some of the difficulties and misunderstandings that happen in accounting theory, practice, regulation and education are grounded in language and linguistics. As an illustration of this the Persian equivalent of 'accounting' is linguistically analysed to reveal how a mistranslation may cause difficulties in understanding and improving Iranian Accounting. This paper shows how Hesaabdaaree is not ...
متن کاملThe Use of Word Sense Disambiguation in an Information Extraction System
This paper describes a rule-based methodology for word sense disambiguation and an application of the methodology to information extraction using rules generalized with the help of the WordNet system. The methodology creates word sense dis-ambiguation rules based on user trained examples working in the domain of interest. It achieves accuracy rates comparable to the best competing methods and c...
متن کامل