Combining Semantic Comprehension and Machine Learning for Chinese Sentiment Classification

نویسندگان

  • Jianfeng Xu
  • Yuan Xu
  • Yuanjian Zhang
  • Yu Li
چکیده

Semantic comprehension-based and machine learning based are two major methods for the classification of Chinese sentiment. The advantage of semantic comprehension-based method is that it can classify text among domains and achieve satisfied portability. However, the accuracy of classification is limited. Although the accuracy derived from supervised machine learning method is much better, the portability is rather poor due to randomly selection of samples and subjective labeling of semantic orientation. In this paper, a hybrid framework combining the advantages of the two methods was proposed. The text features were extracted preliminary based on semantic comprehension and were optimized by a novel information gain method. The features expressed in vector space model were integrated with traditional machine learning algorithm. Experiments show that support vector machine has the best discriminative power compared to other machine learning algorithms. Additionally, this framework improves portability and accuracy as compared to both semantic comprehension-based methods and machine learning based methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Study of Sentiment Analysis for Chinese Microblogging

This paper used three machine learning algorithms, three kinds of feature selection methods and three feature weight methods to study the sentiment classification for Chinese microblogging. The experimental results indicate that the performance of SVM is best in three machine learning algorithms; IG is the better feature selection method compared to the other methods, and TF-IDF is best fit for...

متن کامل

Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

This paper presents a step-by-step methodology for Twitter sentiment analysis. Two approaches are tested to measure variations in the public opinion about retail brands. The first, a lexicon-based method, uses a dictionary of words with assigned to them semantic scores to calculate a final polarity of a tweet, and incorporates part of speech tagging. The second, machine learning approach, tackl...

متن کامل

XRCE at SemEval-2016 Task 5: Feedbacked Ensemble Modeling on Syntactico-Semantic Knowledge for Aspect Based Sentiment Analysis

This paper presents our contribution to the SemEval 2016 task 5: Aspect-Based Sentiment Analysis. We have addressed Subtask 1 for the restaurant domain, in English and French, which implies opinion target expression detection, aspect category and polarity classification. We describe the different components of the system, based on composite models combining sophisticated linguistic features wit...

متن کامل

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

Predicting the Semantic Orientation of Terms in E-HowNet

The semantic orientation of terms is fundamental for sentiment analysis in sentence and document levels. Although some Chinese sentiment dictionaries are available, how to predict the orientation of terms automatically is still important. In this paper, we predict the semantic orientation of terms of E-HowNet. We extract many useful features from different sources to represent a Chinese term in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015