A Study on Microblog and Search Engine User Behaviors: How Twitter Trending Topics Help Predict Google Hot Queries
نویسندگان
چکیده
Once every five minutes, Twitter publishes a list of trending topics by monitoring and analyzing tweets from its users. Similarly, Google makes available hourly a list of hot queries that have been issued to the search engine. We claim that social trends fired by Twitter may help explain and predict web trends derived from Google. Indeed, we argue that information flooding nearly real-time across the Twitter social network could anticipate the set of topics that users will later search on the Web. In this work, we analyze the time series derived from the daily volume index of each trend, either by Twitter or Google. Our study on a real-world dataset reveals that about 26% of the trending topics raising from Twitter “asis” are also found as hot queries issued to Google. Also, we find that about 72% of the similar trends appear first on Twitter. Thus, we assess the relation between comparable Twitter and Google trends by testing three classes of time series regression models. First, we find that Google by its own is not able to effectively predict the time behavior of its trends. Indeed, we show that autoregressive models, which try to fit time series of Google trends, perform poorly. On the other hand, we validate the forecasting power of Twitter by showing that models, which use Google as the dependent variable and Twitter as the explanatory variable, retain as significant the past values of Twitter 60% of times. Moreover, we discover that a Twitter trend causes a similar Google trend to later occur about 43% of times. In the end, we show that the very best-performing models are those using past values of both Twitter and Google.
منابع مشابه
Summarization of Twitter Microblogs
Twitter1, the microblog site started in 2006, has become a social phenomenon. More than 340 million Tweets are sent out every day2. While a majority of posts are conversational or not particularly meaningful, about 3.6% of the posts concern topics of mainstream news3. Twitter has been credited with providing the most current news about many important events before traditional media, such as the...
متن کاملTowards Supporting Search over Trending Events with Social Media
Many search engines identify bursts of activity around particular topics and reflect these back to users as Popular Now or Hot Searches. Activity around these topics typically evolves quickly in real-time during the course of a trending event. Users’ informational needs when searching for such topics will vary depending on the stage at which they engage with an event. Through a survey and log s...
متن کاملSuggestions for Fresh Search Queries by Mining Mircoblog Topics
Query suggestion of Web search has been an effective approach to help users quickly express their information need and more accurately get the information they need. All major web-search engines and most proposed methods that suggest queries rely on query logs of search engine to determine possible query suggestions. However, for search systems, it is much more difficult to effectively suggest ...
متن کاملSemantic Annotation for Microblog Topics Using Wikipedia Temporal Information
Trending topics in microblogs such as Twitter are valuable resources to understand social aspects of real-world events. To enable deep analyses of such trends, semantic annotation is an effective approach; yet the problem of annotating microblog trending topics is largely unexplored by the research community. In this work, we tackle the problem of mapping trending Twitter topics to entities fro...
متن کاملQuery Expansion for Microblog Retrieval
Entries in microblogging sites such as Twitter are very short: a “tweet ”can contain at most 140 characters. Given a user query, retrieving relevant tweets is particularly challenging since their extreme brevity exacerbates the well-known vocabulary mismatch problem. In this preliminary study, we explore standard query expansion approaches as a way to address this problem. Since the tweets are ...
متن کامل