NEAT: News Exploration Along Time

نویسندگان

  • Omar Alonso
  • Klaus Berberich
  • Srikanta J. Bedathur
  • Gerhard Weikum
چکیده

There are a number of efforts towards building applications that leverage temporal information in documents. The demonstration of our NEAT (News Exploration Along Time) prototype system that we propose here, is an attempt towards building an intuitive and exploratory interface for search results over large news archives using timelines. The demonstration uses the New York Times Annotated Corpus as an illustrative example of such a news archive. The NEAT system consists of two parts: the back-end server extracts and stores in an index all the temporal information from documents, and performs important phrase discovery from sentences that have timesensitive information. The front-end user interface, anchors the results of a keyword search along the timeline where the user can explore and browse results at different points in time. To aid in this exploration, the interesting phrases discovered from the result documents are displayed on the timeline to provide an overview. Another key feature of NEAT, which distinguishes it from other timeline-based approaches, is the adoption of semantic temporal annotations to anchor results on the timeline. An appropriate choice of personally-identifiable temporal annotations can enable users to more effectively contextualize results. For example, Barack Obama was elected in 2008 and Germany hosted the FIFA World Cup in 2006. We gathered temporal annotations at large-scale by crowdsourcing it over Amazon Mechanical Turk (AMT). Each HIT (Human Intelligence Task) on AMT consists of a request to expand a temporal expression (such as a year, a time-interval, or decade, etc.) with an entity (e.g., a person, country, organization etc.). Based on the agreement level among workers, we derive key entities for constructing a semantic temporal annotation layer on top the timeline. The outcome is a manually annotated timeline that can be very useful to anchor search results. Examples of annotations produced by crowdsourcing are (1969: Woodstock, Moon landing), (1970: Nixon), and (2003-2009: Iraq war) to name a few with different time granularities. The demonstration consists of an exploratory search interface where we show how queries can produce different timelines and how one can use temporal information to discover interesting facts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-Based Exploration of News Archives

In this paper, we present NEAT, a prototype system that provides an exploration interface to news archive search. Our prototype visualizes search results making use of two kinds of temporal information, namely, news articles’ publication dates but also their contained temporal expressions. The displayed timelines are annotated with major events, harvested using crowdsourcing, to make it easier ...

متن کامل

An Exploration of News Meta-Search Across Multiple Languages

This paper presents aspects involved in executing a crosslanguage meta-search for news. A specific solution, here forth referred to as Global Reporter is described, as are users’ feedback, and information along with fact coverage results from a controlled experiment with the Global Reporter. Where suitable, recommendations for modifying the system are presented as well as trade-offs to alternat...

متن کامل

EXPOSÉ: EXploring Past news fOr Seminal Events

Recent increase in digitalization and archiving efforts on news data have led to overwhelming amount of online information for a general user, thus making it difficult for them to retrospect on past events. One dimension along which past events can be effectively organized is time. Motivated by this idea, we introduce EXPOSÉ, an exploratory search system that explicitly uses temporal informatio...

متن کامل

Applied Visual Exploration on Real-time News Feeds using Polarity and Geo-spatial Analysis

This paper presents a visual analytics approach to explore large news article collections in the domains of polarity and spatial analysis. The exploration is performed on the data collected with Europe Media Monitor (EMM), a system which monitors over 2500 online sources and processes 90,000 articles per day. By analyzing the news feeds, we want to find out which topics are important in differe...

متن کامل

A Flexible Topic-driven Framework for News Exploration

With the flourishing of various Web applications, the Internet has become one of the most important means to access news. According to one investigation, in the population of Internet users, 78.5% are looking for news. Unfortunately, although the Internet provides a platform for easily sharing information, it also brings a fast explosion of the news data. It leads to the fact that people spend ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010