How Topic Biases Your Results? A Case Study of Sentiment Analysis and Irony Detection in Italian
نویسندگان
چکیده
In this paper we present our approach to automatically identify the subjectivity, polarity and irony of Italian Tweets. Our system which reaches and outperforms the state of the art in Italian is well adapted for different domains since it uses abstract word features instead of bag of words. We also present experiments carried out to study how Italian Sentiment Analysis systems react to domain changes. We show that bag of words approaches commonly used in Sentiment Analysis do not adapt well to domain changes.
منابع مشابه
A Logistic Regression Model of Irony Detection in Chinese Internet Texts
The research of sentiment analysis has become fascinating with the support of emerging Internet language material. In this paper, irony in Chinese is investigated as a sentiment that has not been meticulously studied. We describe here a set of features and their computational formalization for detecting irony at a linguistic level. Comments from online forum are collected and detected whether i...
متن کاملOverview of the Evalita 2016 SENTIment POLarity Classification Task
English. The SENTIment POLarity Classification Task 2016 (SENTIPOLC), is a rerun of the shared task on sentiment classification at the message level on Italian tweets proposed for the first time in 2014 for the Evalita evaluation campaign. It includes three subtasks: subjectivity classification, polarity classification, and irony detection. In 2016 SENTIPOLC has been again the most participated...
متن کاملAnnotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola
In this paper we present the TWitterBuonaScuola corpus (TW-BS), a novel Italian linguistic resource for Sentiment Analysis, developed with the main aim of analyzing the online debate on the controversial Italian political reform “Buona Scuola” (Good school), aimed at reorganizing the national educational and training systems. We describe the methodologies applied in the collection and annotatio...
متن کاملIrony Detection: from the Twittersphere to the News Space
English. Automatic detection of irony is one of the hot topics for sentiment analysis, as it changes the polarity of text. Most of the work has been focused on the detection of figurative language in Twitter data due to relative ease of obtaining annotated data, thanks to the use of hashtags to signal irony. However, irony is present generally in natural language conversations and in particular...
متن کاملIronic Gestures and Tones in Twitter
English. Automatic irony detection is a young field of research related to Sentiment Analysis. When dealing with social media data, the shortness of text and the extraction of the statement from his context usually makes it hard to understand irony even for humans but especially for machines. In this paper we propose an analysis of the role that textual information plays in the perception and c...
متن کامل