using fuzzy lr numbers in bayesian text classifier for classifying persian text documents

نویسندگان

چکیده

text classification is an important research field in information retrieval and text mining. the main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. since word detection is a difficult and time consuming task in persian language, bayesian text classifier is an appropriate approach to deal with different word formats and new words. also, fuzzy theory may be used to manage uncertainty in imprecise persian sentences. in this paper, we utilize l-r type fuzzy numbers in bayesian text classifier to classify textual persian documents (fuzzy bayesian text classifier). the obtained results on simulated imprecise textual persian documents show improvements in both recall and precision parameters by using fuzzy bayesian text classification approach over naïve bayesian text classifier

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

Classifying Text Documents by Associating Terms With Text Categories

متن کامل

EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS

Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...

متن کامل

Fuzzy Feature Clustering for Text Classification Using Sequence Classifier

Due to the rapid growth of Internet Technologies and the prosper of WWW, the volume of textual data is increasing more and more, thereby leading to the significance of text classification. Feature Clustering is a powerful method to reduce the dimensionality of feature vector for text classification. Text Classification is one of the important research issues in the field of text mining, where t...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من

عنوان ژورنال:

international journal of information, security and systems management

جلد ۲، شماره ۱، صفحات ۱۱۸-۱۲۳

کلمات کلیدی

text classification fuzzy l r numbers bayesian classification

میزبانی شده توسط پلتفرم ابری doprax.com