SVM Approach to Forum and Comment Moderation
نویسنده
چکیده
Social networks, blogs, and forums bring users together to build a community usually based on a system of communication through comments. Based on the degree of complexity, these systems may or may not have a moderator whose primary purpose is to remove spam or abusive comments within these systems. When a moderator is used, it is most often a human, whose time and energy must be exerted to read each and every comment making it a tedious job for a large website. A support vector machine (SVM) approach is proposed for comment moderation. Using a training corpus obtained from the popular website Youtube.com, a support vector machine is used to classify comments as abusive or not. Baseline accuracy is found by performing 10-fold cross validation on unprocessed data. Different experiments are performed on the data by preprocessing it to find if certain variations provide a more accurate estimate.
منابع مشابه
Automatic Moderation of Comments in a Large On-line Journalistic Environment
On-line journalistic sites publish several news and stories every day. Readers of these sites may comment a story, and, as a consequence, a single story might receive thousands of comments. The quality of these comments may vary a lot, from spams and trolls to truly useful information. Separating good from bad comments is an important task, and is the primary goal of comment moderation. Moderat...
متن کاملKeyGraph for Visualization of Discussions in Comments of a Blog Entry with Comment Scores
This paper discusses a new application of KeyGraph for visualization of discussions in comments of a blog entry in Slashdot. KeyGraph is a visualization tool for discovery of relations among text-based data. A common approach of applying KeyGraph is that of applying it to the whole data at once. In this paper, we propose an approach that applies KeyGraph successively to multiple chunks of comme...
متن کاملHow a moderated online discussion forum facilitates support for young people with eating disorders
INTRODUCTION Young people with eating disorders are at risk of harm to their social, emotional and physical development and life chances. Although they can be reluctant to seek help, they may access social media for information, advice or support. The relationship between social media and youth well-being is an emotive subject, but not clearly understood. This qualitative study aimed to explore...
متن کاملExtracting Chatbot Knowledge from Online Discussion Forums
This paper presents a novel approach for extracting high-quality pairs as chat knowledge from online discussion forums so as to efficiently support the construction of a chatbot for a certain domain. Given a forum, the high-quality pairs are extracted using a cascaded framework. First, the replies logically relevant to the thread title of the root mes...
متن کاملLearning to Perform Moderation in Online Forums
Online discussion forums are a valuable resource for people looking to find information, discuss ideas, and get advice on the Internet. Unfortunately, many forums have too much activity and information available, resulting in information overload. Moderation systems are implemented in some forums as a way to handle this problem, but due to sparsity issues, they are often not sufficient. In this...
متن کامل