Online Adspace Posts' Category Classification
نویسندگان
چکیده
Online adspaces require the seller/buyer to select a category to post their advertisements. This practice not only causes hindrance to legitimate users while posting their advertisements, but, also deter the user experience as they will see a lot of non-categorized ads due to human error or spam. Classifying advertisements posted on an online adspace can help in spam detection, better information propagation which in turn enhances user experience. Craigslist is a prevalent platform for local classified advertisements. In this paper, we present a classification system for an online advertisement space such as Craigslist. We show the performance of our algorithm with three standard classifiers, viz., Support Vector Machine, Random Forest, and Multinomial Naive Bayes. An accuracy of 84% was achieved with the SVM.
منابع مشابه
Automated Speech Act Classification For Online Chat
In this paper, we present our investigation on using supervised machine learning methods to automatically classify online chat posts into speech act categories, which are semantic categories indicating speakers’ intentions. Supervised machine learning methods presuppose the existence of annotated training data based on which machine learning algorithms can be used to learn the parameters of som...
متن کاملExploring Societal Risk Classification of the Posts of Tianya Club
To identify the societal risk category of the posts of Tianya Club, several studies are carried out toward the posts of Tianya Club. With 2-month manually risk labeled new posts published during December of 2011 to January of 2012, statistical analysis of posts is conducted at first. Later, similarity analysis of posts from one risk category, different risk categories and published on different...
متن کاملCollective Stance Classification of Posts in Online Debate Forums
Online debate sites are a large source of informal and opinion-sharing dialogue on current socio-political issues. Inferring users’ stance (PRO or CON) towards discussion topics in domains such as politics or news is an important problem, and is of utility to researchers, government organizations, and companies. Predicting users’ stance supports identification of social and political groups, bu...
متن کاملRecognition of Sentiment Sequences in Online Discussions
Currently 19%-28% of Internet users participate in online health discussions. In this work, we study sentiments expressed on online medical forums. As well as considering the predominant sentiments expressed in individual posts, we analyze sequences of sentiments in online discussions. Individual posts are classified into one of the five categories encouragement, gratitude, confusion, facts, an...
متن کاملMining Sentiment Classification from Political Web Logs
Over the last few years the number of web logs and amount of opinionated data on the World Wide Web has grown dramatically. Web logs allow people to share their opinions on a wide range of “hot” topics with virtual communities. As readers start turning to web logs as a source of information, automatic techniques that identify the sentiment of web log posts will help bloggers categorize and filt...
متن کامل