Exploiting News to Categorize Tweets: Quantifying the Impact of Different News Collections

نویسندگان

  • Marco Pavan
  • Stefano Mizzaro
  • Matteo Bernardon
  • Ivan Scagnetto
چکیده

Short texts, due to their nature which makes them full of abbreviations and new coined acronyms, are not easy to classify. Text enrichment is emerging in the literature as a potentially useful tool. This paper is a part of a longer term research that aims at understanding the effectiveness of tweet enrichment by means of news, instead of the whole web as a knowledge source. Since the choice of a news collection may contribute to produce very different outcomes in the enrichment process, we compare the impact of three features of such collections: volume, variety, and freshness. We show that all three features have a significant impact on categorization accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling the Impact of News on volatility The Case of Iran

In this paper various ARCH models and relevant news impact curves including a partially nonparametric (PNP) one are compared and estimated with daily Iran stock return data. Diagnostic tests imply the asymmetry of the volatility response to news. The EGARCH model, which passes all the tests and appears relatively matching with the asymmetry in the data, seems to be the most adequate characteriz...

متن کامل

News-Topic Oriented Hashtag Recommendation in Twitter Based on Characteristic Co-occurrence Word Detection

Hashtags, which started to be widely used since 2007, are always utilized to mark keywords in tweets to categorize messages and form conversation for topics in Twitter. However, it is hard for users to use hashtags for sharing their opinions/interests/comments for their interesting topics. In this paper, we present a new approach for recommending news-topic oriented hashtags to help Twitter use...

متن کامل

Tweet-Recommender: Finding Relevant Tweets for News Articles

Twitter has become a prime source for disseminating news and opinions. However, the length of tweets prohibits detailed descriptions; instead, tweets sometimes contain URLs that link to detailed news articles. In this paper, we devise generic techniques for recommending tweets for any given news article. To evaluate and compare the different techniques, we collected tens of thousands of tweets ...

متن کامل

A Study on News Anchors’ Meta-Language and Non-Verbal Factors and their Impact on Audiences

Non-verbal communication or body messaging occurs when facial expressions, tone of voice, head and neck movements, smiling and ... affects others; which may be intentional or unintentional. Farhangi in nonverbal communication: the art of using movement and sound” defines this field as such: "Non-verbal communication is phonetic and non-phonetic messages which have been explained by other than l...

متن کامل

Distant Supervision for Topic Classification of Tweets in Curated Streams

We tackle the challenge of topic classi€cation of tweets in the context of analyzing a large collection of curated streams by news outlets and other organizations to deliver relevant content to users. Our approach is novel in applying distant supervision based on semi-automatically identifying curated streams that are topically focused (for example, on politics, entertainment, or sports). Œese ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016