Predicting User Competence from Text
نویسنده
چکیده
We explore the possibility of learning user competence from a text by using natural language processing and machine learning (ML) methods. In our context, competence is defined as the ability to identify the wildlife appearing in images and classifying into species correctly. We evaluate and compare the performance (regarding accuracy and Fmeasure) of the three ML methods, Naive Bayes (NB), Decision Trees (DT) and K-nearest neighbors (KNN), applied to the text corpus obtained from the Snapshot Senrengeti discussion forum posts. The baseline results show, that regarding accuracy, DT outperforms NB and KNN by 16.00%, and 15.00% respectively. Regarding F-measure, K-NN outperforms NB and DT by 12.08% and 1.17%, respectively. We also propose a hybrid model that combines the three models (DT, NB and KNN). We improve the baseline results with the calibration technique and additional features. Adding a bi-gram feature has shown a dramatic increase (from 48.38% to 64.40%) of accuracy for NB model. We achieved to push the accuracy limit in the baseline models from 93.39% to 94.09%.
منابع مشابه
Bandwidth and Delay Optimization by Integrating of Software Trust Estimator with Multi-User Cloud Resource Competence
Trust Establishment is one of the significant resources to enhance the scalability and reliability of resources in the cloud environment. To establish a novel trust model on SaaS (Software as a Service) cloud resources and to optimize the resource utilization of multiple user requests, an integrated software trust estimator with multi-user resource competence (IST-MRC) optimization mechanism is...
متن کاملInvestigate the role of locus of control and social problem-solving competence in predicting students’ achievement motivation
The purpose of the present research is to investigate the role of locus of control and social problem-solving competence in predicting students’ achievement motivation in Karaj, Iran. The present study is of the correlational research type. The research population includes all female and male students in District four in Karaj, from among whom 200 students (100 female students and 100 male stud...
متن کاملThe Value of Perfectionism in Predicting Coping Strategies in Drug-User Women
Background: Positive perfectionism helps the individual to experience fewer worries and less anxiety. The aim of the present study was to assess the value of coping strategies to predict perfectionism in drug-user women. Methods: This cross-sectional study was performed on 361 consecutive drug-user women who were randomly selected from a total of 6237 women referring to the Drug Abuse Centers o...
متن کاملPredicting User Views in Online News
We analyze user viewing behavior on an online news site. We collect data from 64,000 news articles, and use text features to predict frequency of user views. We compare predictiveness of the headline and “teaser” (viewed before clicking) and the body (viewed after clicking). Both are predictive of clicking behavior, with the full article text being most predictive.
متن کاملAuthor gender identification from text using Bayesian Random Forest
Nowadays high usage of users from virtual environments and their connection via social networks like Facebook, Instagram, and Twitter shows the necessity of finding out shared subjects in this environment more than before. There are several applications that benefit from reliable methods for inferring age and gender of users in social media. Such applications exist across a wide area of fields,...
متن کامل