نتایج جستجو برای: inter rater reliability

تعداد نتایج: 255783  

2017
Maria Pia di Buono Martin Tutek Jan Snajder Goran Glavas Bojana Dalbelo Basic Natasa Milic-Frayling

In this paper, we describe our preliminary study of methods for annotating event mentions as part of our research on highprecision models for event extraction from news. We propose a two-layer annotation scheme, designed to capture the functional and the conceptual aspects of event mentions separately. We hypothesize that the precision can be improved by modeling and extracting the different as...

Journal: :Studies in health technology and informatics 2004
Eric Zapletal Christel Daniel-Le Bozec Patrice Degoulet Jean-Marc Guinebretière Marie-Christine Jaulent

In the pathology domain, consensus sessions around multi-headed microscopes enhance reproducibility and can reduce inter- and intra-observer variability. Computerized tools and Web technology could facilitate the organization of consensus sessions and assist pathologists to agree on features that are relevant to diagnosis. In the context of the IDEM project, whose aim is to achieve a computeriz...

2013
Anna Nedoluzhko Jirí Mírovský

In this paper, we present the results of the parallel Czech coreference and bridging annotation in the Prague Dependency Treebank 2.0. The annotation is carried out on dependency trees (on the tectogrammatical layer). We describe the inter-annotator agreement measurement, classify and analyse the most common types of annotators’ disagreement. On two selected long texts, we asked the annotators ...

Journal: :CoRR 2011
Shibamouli Lahiri Xiaofei Lu

Formality is one of the most important dimensions of writing style variation. In this study we conducted an inter-rater reliability experiment for assessing sentence formality on a five-point Likert scale, and obtained good agreement results as well as different rating distributions for different sentence categories. We also performed a difficulty analysis to identify the bottlenecks of our rat...

2003
Sven Poulsen

This paper reviews some of the commonly used indices for measurement of gingivitis and periodontal disease. Periodontal disease should be measured using loss of attachment, not pocket depth. The reliability of several of the indices has been tested. Calibration and training of examiners seems to be an absolute requirement for a satisfactory inter-examiner reliability. Gingival and periodontal d...

2014
Mariano J. Fresneda Juan J. Dere Carlos H. Yacuzzi Matías Costa Paz

This open-access article is published and distributed under the Creative Commons Attribution NonCommercial No Derivatives License (http://creativecommons.org/licenses/by-nc-nd/3.0/), which permits the noncommercial use, distribution, and reproduction of the article in any medium, provided the original author and source are credited. You may not alter, transform, or build upon this article witho...

2005
Celso Tello Jeffrey Liebmann Seth D. Potash Henry Cohen Robert Ritch

Methods. Four anterior segment images of four normal patients were obtained by a single examiner. The measurements of three independent observers were compared to assess interobserver reproducibility in quantifying the images. Thirteen different anterior segment parameters were measured by each observer on each image. Intraobserver and interobserver reproducibility of measurement were assessed ...

2010
Marta Recasens Eduard H. Hovy Maria Antònia Martí

The task of coreference resolution requires people or systems to decide when two referring expressions refer to the ‘same’ entity or event. In real text, this is often a difficult decision because identity is never adequately defined, leading to contradictory treatment of cases in previous work. This paper introduces the concept of ‘near-identity’, a middle ground category between identity and ...

2011
Anaïs Cadilhac Nicholas Asher Farah Benamara Alex Lascarides

We propose a method for modelling how dialogue moves influence and are influenced by the agents’ preferences. We extract constraints on preferences and dependencies among them, even when they are expressed indirectly, by exploiting discourse structure. Our method relies on a study of 20 dialogues chosen at random from the Verbmobil corpus. We then test the algorithms predictions against the jud...

2007
Martha Palmer Hoa Trang Dang Joseph Rosenzweig

This paper describes the methodology that is being used to augment the Penn Treebank annotation with sense tags and other types of semantic information. Inspired by the results of SENSEVAL, and the high inter-annotator agreement that was achieved there, similar methods were used for a pilot study of 5000 words of running text from the Penn Treebank. Using the same techniques of allowing the ann...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید