inter rater reliability

Two Layers of Annotation for Representing Event Mentions in News Stories

2017

Maria Pia di Buono Martin Tutek Jan Snajder Goran Glavas Bojana Dalbelo Basic Natasa Milic-Frayling

In this paper, we describe our preliminary study of methods for annotating event mentions as part of our research on highprecision models for event extraction from news. We propose a two-layer annotation scheme, designed to capture the functional and the conceptual aspects of event mentions separately. We hypothesize that the precision can be improved by modeling and extracting the different as...

متن کامل

Specifications and implementation of a new exchange format to support computerized consensus in pathology

Journal: :Studies in health technology and informatics 2004

Eric Zapletal Christel Daniel-Le Bozec Patrice Degoulet Jean-Marc Guinebretière Marie-Christine Jaulent

In the pathology domain, consensus sessions around multi-headed microscopes enhance reproducibility and can reduce inter- and intra-observer variability. Computerized tools and Web technology could facilitate the organization of consensus sessions and assist pathologists to agree on features that are relevant to diagnosis. In the context of the IDEM project, whose aim is to achieve a computeriz...

متن کامل

Annotators' Certainty and Disagreements in Coreference and Bridging Annotation in Prague Dependency Treebank

2013

Anna Nedoluzhko Jirí Mírovský

In this paper, we present the results of the parallel Czech coreference and bridging annotation in the Prague Dependency Treebank 2.0. The annotation is carried out on dependency trees (on the tectogrammatical layer). We describe the inter-annotator agreement measurement, classify and analyse the most common types of annotators’ disagreement. On two selected long texts, we asked the annotators ...

متن کامل

Inter-rater Agreement on Sentence Formality

Journal: :CoRR 2011

Shibamouli Lahiri Xiaofei Lu

Formality is one of the most important dimensions of writing style variation. In this study we conducted an inter-rater reliability experiment for assessing sentence formality on a five-point Likert scale, and obtained good agreement results as well as different rating distributions for different sentence categories. We also performed a difficulty analysis to identify the bottlenecks of our rat...

متن کامل

Epidemiology and indices of gingival and periodontal disease

2003

Sven Poulsen

This paper reviews some of the commonly used indices for measurement of gingivitis and periodontal disease. Periodontal disease should be measured using loss of attachment, not pocket depth. The reliability of several of the indices has been tested. Calibration and training of examiners seems to be an absolute requirement for a satisfactory inter-examiner reliability. Gingival and periodontal d...

متن کامل

ISAKOS Classification of Meniscal Tears. Intra and Interobserver Reliability.

2014

Mariano J. Fresneda Juan J. Dere Carlos H. Yacuzzi Matías Costa Paz

This open-access article is published and distributed under the Creative Commons Attribution NonCommercial No Derivatives License (http://creativecommons.org/licenses/by-nc-nd/3.0/), which permits the noncommercial use, distribution, and reproduction of the article in any medium, provided the original author and source are credited. You may not alter, transform, or build upon this article witho...

متن کامل

Reports Measurement of Ultrasound Biomicroscopy Images: Intraobserver and Interobserver Reliability

2005

Celso Tello Jeffrey Liebmann Seth D. Potash Henry Cohen Robert Ritch

Methods. Four anterior segment images of four normal patients were obtained by a single examiner. The measurements of three independent observers were compared to assess interobserver reproducibility in quantifying the images. Thirteen different anterior segment parameters were measured by each observer on each image. Intraobserver and interobserver reproducibility of measurement were assessed ...

متن کامل

A Typology of Near-Identity Relations for Coreference (NIDENT)

2010

Marta Recasens Eduard H. Hovy Maria Antònia Martí

The task of coreference resolution requires people or systems to decide when two referring expressions refer to the ‘same’ entity or event. In real text, this is often a difficult decision because identity is never adequately defined, leading to contradictory treatment of cases in previous work. This paper introduces the concept of ‘near-identity’, a middle ground category between identity and ...

متن کامل

Commitments to Preferences in Dialogue

2011

Anaïs Cadilhac Nicholas Asher Farah Benamara Alex Lascarides

We propose a method for modelling how dialogue moves influence and are influenced by the agents’ preferences. We extract constraints on preferences and dependencies among them, even when they are expressed indirectly, by exploiting discourse structure. Our method relies on a study of 20 dialogues chosen at random from the Verbmobil corpus. We then test the algorithms predictions against the jud...

متن کامل

Sense Tagging the Penn Treebank

2007

Martha Palmer Hoa Trang Dang Joseph Rosenzweig

This paper describes the methodology that is being used to augment the Penn Treebank annotation with sense tags and other types of semantic information. Inspired by the results of SENSEVAL, and the high inter-annotator agreement that was achieved there, similar methods were used for a pilot study of 5000 words of running text from the Penn Treebank. Using the same techniques of allowing the ann...

متن کامل