using fuzzy lr numbers in bayesian text classifier for classifying persian text documents
نویسندگان
چکیده
text classification is an important research field in information retrieval and text mining. the main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. since word detection is a difficult and time consuming task in persian language, bayesian text classifier is an appropriate approach to deal with different word formats and new words. also, fuzzy theory may be used to manage uncertainty in imprecise persian sentences. in this paper, we utilize l-r type fuzzy numbers in bayesian text classifier to classify textual persian documents (fuzzy bayesian text classifier). the obtained results on simulated imprecise textual persian documents show improvements in both recall and precision parameters by using fuzzy bayesian text classification approach over naïve bayesian text classifier
منابع مشابه
Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملFuzzy Feature Clustering for Text Classification Using Sequence Classifier
Due to the rapid growth of Internet Technologies and the prosper of WWW, the volume of textual data is increasing more and more, thereby leading to the significance of text classification. Feature Clustering is a powerful method to reduce the dimensionality of feature vector for text classification. Text Classification is one of the important research issues in the field of text mining, where t...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
international journal of information, security and systems managementجلد ۲، شماره ۱، صفحات ۱۱۸-۱۲۳
کلمات کلیدی
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023