rater reliability

An Annotated Corpus of Typical Durations of Events

2006

Feng Pan Rutu Mulkar-Mehta Jerry R. Hobbs

In this paper, we present our work on generating an annotated corpus for extracting information about the typical durations of events from texts. We include the annotation guidelines, the event classes we categorized, the way we use normal distributions to model vague and implicit temporal information, and how we evaluate inter-annotator agreement. The experimental results show that our guideli...

متن کامل

Building a reference lexicon for countability in English

2014

Tibor Kiss Francis Jeffry Pelletier Tobias Stadtfeld

The present paper describes the construction of a resource to determine the lexical preference class of a large number of English nouns (≈ 14,000) with respect to the distinction between mass and count interpretations. In constructing the lexicon, we have employed a questionnaire-based approach based on existing resources such as the Open ANC (http://www.anc.org) and WordNet (Miller, 1995). The...

متن کامل

Reliability of videotaped observational gait analysis in patients with orthopedic impairments

Journal: :BMC Musculoskeletal Disorders 2005

Jaap J Brunnekreef Caro JT van Uden Steven van Moorsel Jan GM Kooloos

BACKGROUND In clinical practice, visual gait observation is often used to determine gait disorders and to evaluate treatment. Several reliability studies on observational gait analysis have been described in the literature and generally showed moderate reliability. However, patients with orthopedic disorders have received little attention. The objective of this study is to determine the reliabi...

متن کامل

Assessing Reliability of Medical Record Reviews for the Detection of Hospital Adverse Events

2015

Minsu Ock Sang-il Lee Min-Woo Jo Jin Yong Lee Seon-Ha Kim

OBJECTIVES The purpose of this study was to assess the inter-rater reliability and intra-rater reliability of medical record review for the detection of hospital adverse events. METHODS We conducted two stages retrospective medical records review of a random sample of 96 patients from one acute-care general hospital. The first stage was an explicit patient record review by two nurses to detec...

متن کامل

The Vancouver Scar Scale: an administration tool and its interrater reliability.

Journal: :The Journal of burn care & rehabilitation 1995

M J Baryza G A Baryza

The Burn Scar Index, often called the Vancouver Scar Scale, is widely used in clinical practice and research to document change in scar appearance. Several sections of the Index require equipment to accurately score the items. Additionally, the numeric scores are difficult to remember. We recently devised a pocket-sized tool to aid in scoring the scar and to increase staff compliance in use of ...

متن کامل

A review and update of the Health of the Nation Outcome Scales (HoNOS).

Journal: :BJPsych bulletin 2018

Mick James Jon Painter Bill Buckingham Malcolm W Stewart

Aims and method The Health of the Nation Outcome Scales (HoNOS) and its older adults' version (HoNOS 65+) have been used widely for 20 years, but their glossaries have not been revised to reflect clinicians' experiences or changes in service delivery. The Royal College of Psychiatrists convened an international advisory board, with UK, Australian and New Zealand expertise, to identify desirable...

متن کامل

Collecting Reliable Human Judgements on Machine-Generated Language: The Case of the QG-STEC Data

2016

Keith Godwin Paul Piwek

Question generation (QG) is the problem of automatically generating questions from inputs such as declarative sentences. The Shared Evaluation Task Challenge (QG-STEC) Task B that took place in 2010 evaluated several state-of-the-art QG systems. However, analysis of the evaluation results was affected by low inter-rater reliability. We adapted Nonaka & Takeuchi’s knowledge creation cycle to the...

متن کامل

Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation

2016

Borja Navarro-Colorado María Ribes-Lafoz Noelia Sánchez

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation pr...

متن کامل

Annotating dropped pronouns in Chinese newswire text

2012

Elizabeth Baran Yaqin Yang Nianwen Xue

We propose an annotation framework to explicitly identify dropped subject pronouns in Chinese. We acknowledge and specify 10 concrete pronouns that exist as words in Chinese and 4 abstract pronouns that do not correspond to Chinese words, but that are recognized conceptually, to native Chinese speakers. These abstract pronouns are identified as “unspecified”, “pleonastic”, “event”, and “existen...

متن کامل

360’ Ratings: an Analysis of Assumptions and a Research Agenda for Evaluating Their Validity

2001

Walter C. Borman

This article argues that assumptions surrounding 360” ratings should be examined; most notably, the assumptions that different rating sources have relatively unique perspectives on performance and multiple rating sources provide incremental validity over the individual sources. Studies generally support the first assumption, although reasons for interrater disagreement across different organiza...

متن کامل