A Comparative Study on Linguistic Feature Selection in Sentiment Polarity Classification
نویسنده
چکیده
Sentiment polarity classification is perhaps the most widely studied topic. It classifies an opinionated document as expressing a positive or negative opinion. In this paper, using movie review dataset, we perform a comparative study with different single kind linguistic features and the combinations of these features. We find that the classic topicbased classifier(Naive Bayes and Support Vector Machine) do not perform as well on sentiment polarity classification. And we find that with some combination of different linguistic features, the classification accuracy can be boosted a lot. We give some reasonable explanations about these boosting outcomes.
منابع مشابه
Sentiment Classification and Polarity Shifting
Polarity shifting marked by various linguistic structures has been a challenge to automatic sentiment classification. In this paper, we propose a machine learning approach to incorporate polarity shifting information into a document-level sentiment classification system. First, a feature selection method is adopted to automatically generate the training data for a binary classifier on polarity ...
متن کاملGermanPolarityClues: A Lexical Resource for German Sentiment Analysis
In this paper, we propose GermanPolarityClues, a new publicly available lexical resource for sentiment analysis for the German language. While sentiment analysis and polarity classification has been extensively studied at different document levels (e.g. sentences and phrases), only a few approaches explored the effect of a polarity-based feature selection and subjectivity resources for the Germ...
متن کاملAn Evaluation of Sentiment Analysis and Classification Algorithms for Arabic Textual Data
Sentiment analysis is a recent advance in text mining applications for analyzing textual data according to orientation of human comments to determine whether they are positive, negative, or neutral. Different data mining techniques and algorithms such as support vector machine, naïve Bayes, decision tree, k-nearest neighbor and other techniques are used for analyzing textual data. These techniq...
متن کاملImproving Document-Level Sentiment Classification Using Contextual Valence Shifters
Traditional sentiment feature extraction methods in documentlevel sentiment classification either count the frequencies of sentiment words as features, or the frequencies of modified and unmodified instances of each of these words. However, these methods do not represent the sentiment words’ linguistic context efficiently. We propose a novel method and feature set to handle the contextual polar...
متن کاملFinding Domain Specific Polar Words for Sentiment Classification
This paper presents a method of using conditional random fields (CRF) for extracting polar words and determining the overall sentiment of text. We frame sentiment classification as a feature selection problem and conduct three sets of experiments by using: prior polarity lexicons, bag-of-words classifiers and CRF sequence models. The results show the potential of utilizing CRFs in discovering h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1311.0833 شماره
صفحات -
تاریخ انتشار 2013