نتایج جستجو برای: inter rater reliability

تعداد نتایج: 255783  

2006
Feng Pan Rutu Mulkar-Mehta Jerry R. Hobbs

In this paper, we present our work on generating an annotated corpus for extracting information about the typical durations of events from texts. We include the annotation guidelines, the event classes we categorized, the way we use normal distributions to model vague and implicit temporal information, and how we evaluate inter-annotator agreement. The experimental results show that our guideli...

2014
Tibor Kiss Francis Jeffry Pelletier Tobias Stadtfeld

The present paper describes the construction of a resource to determine the lexical preference class of a large number of English nouns (≈ 14,000) with respect to the distinction between mass and count interpretations. In constructing the lexicon, we have employed a questionnaire-based approach based on existing resources such as the Open ANC (http://www.anc.org) and WordNet (Miller, 1995). The...

2014
Per Erik Solberg Arne Skjærholt Lilja Øvrelid Kristin Hagen Janne Bondi Johannessen

The Norwegian Dependency Treebank is a new syntactic treebank for Norwegian Bokmål and Nynorsk with manual syntactic and morphological annotation, developed at the National Library of Norway in collaboration with the University of Oslo. It is the first publically available treebank for Norwegian. This paper presents the core principles behind the syntactic annotation and how these principles we...

Journal: :The Journal of the Acoustical Society of America 1994
J Kreiman B R Gerratt G S Berke

Although the terms "breathy" and "rough" are frequently applied to pathological voices, widely accepted definitions are not available and the relationship between these qualities is not understood. To investigate these matters, expert listeners judged the dissimilarity of pathological voices with respect to breathiness and roughness. A second group of listeners rated the voices on unidimensiona...

2012
Yann Mathet Antoine Widlöcher Karën Fort Claire François Olivier Galibert Cyril Grouin Juliette Kahn Sophie Rosset Pierre Zweigenbaum

Computing inter-annotator agreement measures on a manually annotated corpus is necessary to evaluate the reliability of its annotation. However, the interpretation of the obtained results is recognized as highly arbitrary. We describe in this article a method and a tool that we developed which “shuffles” a reference annotation according to different error paradigms, thereby creating artificial ...

2010
Patrick Paroubek Alexander Pak Djamel Mostefa

After presenting opinion and sentiment analysis state of the art and the DOXA project, we review the few evaluation campaigns that have dealt in the past with opinion mining. Then we present the two level opinion and sentiment model that we will use for evaluation in the DOXA project and the annotation interface we use for hand annotating a reference corpus. We then present the corpus which wil...

2013
Shu Cai Kevin Knight

The evaluation of whole-sentence semantic structures plays an important role in semantic parsing and large-scale semantic structure annotation. However, there is no widely-used metric to evaluate wholesentence semantic structures. In this paper, we present smatch, a metric that calculates the degree of overlap between two semantic feature structures. We give an efficient algorithm to compute th...

2013
Debanka Nandi Maaz Nomani Himanshu Sharma Himani Chaudhary Sambhav Jain Dipti Misra Sharma

The paper presents our work on the annotation of intra-chunk dependencies on an English treebank that was previously annotated with Inter-chunk dependencies, and for which there exists a fully expanded parallel Hindi dependency treebank. This provides fully parsed dependency trees for the English treebank. We also report an analysis of the inter-annotator agreement for this chunk expansion task...

Journal: :Journal of personality assessment 2007
Mel Hamel Thomas W Shaffer

Building on our previously published study (Hamel, Shaffer, & Erdberg, 2000), which provided data on 100 nonpatient children aged 6 to 12 from the United States, we here provide reference data for two more homogeneous age subgroups: 6 to 9 (N = 50) and 10 to 12 (N = 50). Inclusion criteria are described, and expanded interrater reliability statistics at the response level are presented along wi...

2008
Georgiana Puscasu Verginica Barbu Mititelu

This paper reports on the annotation of all English verbs included in WordNet 2.0 with TimeML event classes. Two annotators assign each verb present in WordNet the most relevant event class capturing most of that verb’s meanings. At the end of the annotation process, inter-annotator agreement is measured using kappa statistics, yielding a kappa value of 0.87. The cases of disagreement between t...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید