نتایج جستجو برای: rater reliability

تعداد نتایج: 145715  

2009
Lee M. Christensen Henk Harkema Peter J. Haug Jeannie Yuhaniak Irwin Wendy W. Chapman

This paper introduces ONYX, a sentencelevel text analyzer that implements a number of innovative ideas in syntactic and semantic analysis. ONYX is being developed as part of a project that seeks to translate spoken dental examinations directly into chartable findings. ONYX integrates syntax and semantics to a high degree. It interprets sentences using a combination of probabilistic classifiers,...

2006
Sanaz Jabbari Ben Allison David Guthrie Louise Guthrie

This paper describes the largest scale annotation project involving the Enron email corpus to date. Over 12,500 emails were classified, by humans, into the categories “Business” and “Personal”, and then subcategorised by type within these categories. The paper quantifies how well humans perform on this task (evaluated by inter-annotator agreement). It presents the problems experienced with the ...

2009
Luis M. T. Jesus Anna Barney Ricardo Santos Janine Caetano Juliana Jorge Pedro Sá-Couto

This paper presents Universidade de Aveiro’s Voice Evaluation Protocol for European Portuguese (EP), and a preliminary inter-rater reliability study. Ten patients with vocal pathology were assessed, by two Speech and Language Therapists (SLTs). Protocol parameters such as overall severity, roughness, breathiness, change of loudness (CAPEV), grade, breathiness and strain (GRBAS), glottal attack,...

2017
Jesse Dunietz Lori S. Levin Jaime G. Carbonell

Language of cause and effect captures an essential component of the semantics of a text. However, causal language is also intertwined with other semantic relations, such as temporal precedence and correlation. This makes it difficult to determine when causation is the primary intended meaning. This paper presents BECauSE 2.0, a new version of the BECauSE corpus with exhaustively annotated expre...

2016
Oded Avraham Yoav Goldberg

We suggest a new method for creating and using gold-standard datasets for word similarity evaluation. Our goal is to improve the reliability of the evaluation, and we do this by redesigning the annotation task to achieve higher inter-rater agreement, and by defining a performance measure which takes the reliability of each annotation decision in the dataset into account.

2014
Tobias Bocklet Andreas K. Maier Korbinian Riedhammer Ulrich Eysholdt Elmar Nöth

In this paper we describe Erlangen-CLP, a large speech database of children with Cleft Lip and Palate. More than 800 German children with CLP (most of them between 4 and 18 years old) and 380 age matched control speakers spoke the semi-standardized PLAKSS test that consists of words with all German phonemes in different positions. So far 250 CLP speakers were manually transcribed, 120 of these ...

Journal: :Cadernos de saude publica 2013
Eulilian Dias de Freitas Vitor Passos Camargos César Coelho Xavier Waleska Teixeira Caiaffa Fernando Augusto Proietti

Systematic social observation has been used as a health research methodology for collecting information from the neighborhood physical and social environment. The objectives of this article were to describe the operationalization of direct observation of the physical and social environment in urban areas and to evaluate the instrument's reliability. The systematic social observation instrument ...

Journal: :Journal for nurses in professional development 2013
Robie V Hughes Sherrill J Smith Clair M Sheffield Grady Wier

This multi-site, quasi-experimental study examined the performance outcomes of nurses (n = 152) in a military nurse transition program. A modified-performance instrument was used to assess participants in two high-fidelity simulation scenarios. Although results indicated a significant increase in scores posttraining, only moderate interrater reliability results were found for the new instrument...

Journal: :Manual therapy 2014
Paul A van den Dolder Paulo H Ferreira Kathryn Refshauge

The aim of this reliability study was to identify the clinimetric properties, specifically intra- and inter-rater reliability, for measuring the functionally and clinically important hand behind back (combined shoulder internal rotation/adduction and elbow flexion) range of motion using a modified technique. Sixty asymptomatic participants (20 male, 40 female) aged 45.4 ± 11.7 years (mean ± SD)...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید