Building a Cross-document Event-Event Relation Corpus

نویسندگان

  • Yu Hong
  • Tongtao Zhang
  • Tim O'Gorman
  • Sharone Horowit-Hendler
  • Heng Ji
  • Martha Palmer
چکیده

We propose a new task of extracting eventevent relations across documents. We present our efforts at designing an annotation schema and building a corpus for this task. Our schema includes five main types of relations: Inheritance, Expansion, Contingency, Comparison and Temporality, along with 21 subtypes. We also lay out the main challenges based on detailed inter-annotator disagreement and error analysis. We hope these resources can serve as a benchmark to encourage research on this new problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GPLSIUA: Combining Temporal Information and Topic Modeling for Cross-Document Event Ordering

Building unified timelines from a collection of written news articles requires cross-document event coreference resolution and temporal relation extraction. In this paper we present an approach event coreference resolution according to: a) similar temporal information, and b) similar semantic arguments. Temporal information is detected using an automatic temporal information system (TIPSem), wh...

متن کامل

MEANTIME, the NewsReader Multilingual Event and Time Corpus

In this paper, we present the NewsReader MEANTIME corpus, a semantically annotated corpus of Wikinews articles. The corpus consists of 480 news articles, i.e. 120 English news articles and their translations in Spanish, Italian, and Dutch. MEANTIME contains annotations at different levels. The document-level annotation includes markables (e.g. entity mentions, event mentions, time expressions, ...

متن کامل

Building a Gold Standard for Event Detection in Croatian

This paper describes the process of building a newspaper corpus annotated with events described in specific documents. The main difference to the corpora built as part of the TDT initiative is that documents are not annotated by topics, but by specific events they describe. Additionally, documents are gathered from sixteen sources and all documents in the corpus are annotated with the correspon...

متن کامل

Ha, Eun Young. Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks. (under the Direction of Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks

HA, EUN YOUNG. Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks. (Under the direction of James C. Lester.) Recent years have seen significant progress in natural language processing. A key challenge posed by many natural language applications ranging from text summarization to question answering and machine translation is ...

متن کامل

Building Patterns for Biomedical Event Extraction

Generally, Event Extraction is to identify any instance of a particular class of events in a natural language text, to extract the relevant arguments of the event, and to represent the extracted information into a structured form.1Let us define Event on the binary relation between two entities for special event verbs which are predefined by biologists. Here, Entity means biomedical entities suc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016