rater reliability

ONYX: A System for the Semantic Analysis of Clinical Text

2009

Lee M. Christensen Henk Harkema Peter J. Haug Jeannie Yuhaniak Irwin Wendy W. Chapman

This paper introduces ONYX, a sentencelevel text analyzer that implements a number of innovative ideas in syntactic and semantic analysis. ONYX is being developed as part of a project that seeks to translate spoken dental examinations directly into chartable findings. ONYX integrates syntax and semantics to a high degree. It interprets sentences using a combination of probabilistic classifiers,...

متن کامل

Towards the Orwellian Nightmare: Separation of Business and Personal Emails

2006

Sanaz Jabbari Ben Allison David Guthrie Louise Guthrie

This paper describes the largest scale annotation project involving the Enron email corpus to date. Over 12,500 emails were classified, by humans, into the categories “Business” and “Personal”, and then subcategorised by type within these categories. The paper quantifies how well humans perform on this task (evaluated by inter-annotator agreement). It presents the problems experienced with the ...

متن کامل

Universidade de aveiro's voice evaluation protocol

2009

Luis M. T. Jesus Anna Barney Ricardo Santos Janine Caetano Juliana Jorge Pedro Sá-Couto

This paper presents Universidade de Aveiro’s Voice Evaluation Protocol for European Portuguese (EP), and a preliminary inter-rater reliability study. Ten patients with vocal pathology were assessed, by two Speech and Language Therapists (SLTs). Protocol parameters such as overall severity, roughness, breathiness, change of loudness (CAPEV), grade, breathiness and strain (GRBAS), glottal attack,...

متن کامل

The BECauSE Corpus 2.0: Annotating Causality and Overlapping Relations

2017

Jesse Dunietz Lori S. Levin Jaime G. Carbonell

Language of cause and effect captures an essential component of the semantics of a text. However, causal language is also intertwined with other semantic relations, such as temporal precedence and correlation. This makes it difficult to determine when causation is the primary intended meaning. This paper presents BECauSE 2.0, a new version of the BECauSE corpus with exhaustively annotated expre...

متن کامل

Improving Reliability of Word Similarity Evaluation by Redesigning Annotation Task and Performance Measure

2016

Oded Avraham Yoav Goldberg

We suggest a new method for creating and using gold-standard datasets for word similarity evaluation. Our goal is to improve the reliability of the evaluation, and we do this by redesigning the annotation task to achieve higher inter-rater agreement, and by defining a performance measure which takes the reliability of each annotation decision in the dataset into account.

متن کامل

Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate

2014

Tobias Bocklet Andreas K. Maier Korbinian Riedhammer Ulrich Eysholdt Elmar Nöth

In this paper we describe Erlangen-CLP, a large speech database of children with Cleft Lip and Palate. More than 800 German children with CLP (most of them between 4 and 18 years old) and 380 age matched control speakers spoke the semi-standardized PLAKSS test that consists of words with all German phonemes in different positions. So far 250 CLP speakers were manually transcribed, 120 of these ...

متن کامل

[A systematic social observation tool: methods and results of inter-rater reliability].

Journal: :Cadernos de saude publica 2013

Eulilian Dias de Freitas Vitor Passos Camargos César Coelho Xavier Waleska Teixeira Caiaffa Fernando Augusto Proietti

Systematic social observation has been used as a health research methodology for collecting information from the neighborhood physical and social environment. The objectives of this article were to describe the operationalization of direct observation of the physical and social environment in urban areas and to evaluate the instrument's reliability. The systematic social observation instrument ...

متن کامل

THE ASSESSMENT OF INTERRATER AGREEMENT FOR MULTIPLE ATTRIBUTE RESPONSES by

2008

Lawrence L. Kupper Kerry B. Hafner

متن کامل

Assessing performance outcomes of new graduates utilizing simulation in a military transition program.

Journal: :Journal for nurses in professional development 2013

Robie V Hughes Sherrill J Smith Clair M Sheffield Grady Wier

This multi-site, quasi-experimental study examined the performance outcomes of nurses (n = 152) in a military nurse transition program. A modified-performance instrument was used to assess participants in two high-fidelity simulation scenarios. Although results indicated a significant increase in scores posttraining, only moderate interrater reliability results were found for the new instrument...

متن کامل

Intra- and inter-rater reliability of a modified measure of hand behind back range of motion.

Journal: :Manual therapy 2014

Paul A van den Dolder Paulo H Ferreira Kathryn Refshauge

The aim of this reliability study was to identify the clinimetric properties, specifically intra- and inter-rater reliability, for measuring the functionally and clinically important hand behind back (combined shoulder internal rotation/adduction and elbow flexion) range of motion using a modified technique. Sixty asymptomatic participants (20 male, 40 female) aged 45.4 ± 11.7 years (mean ± SD)...

متن کامل