نتایج جستجو برای: rater reliability

تعداد نتایج: 145715  

2006
Feng Pan Rutu Mulkar-Mehta Jerry R. Hobbs

In this paper, we present our work on generating an annotated corpus for extracting information about the typical durations of events from texts. We include the annotation guidelines, the event classes we categorized, the way we use normal distributions to model vague and implicit temporal information, and how we evaluate inter-annotator agreement. The experimental results show that our guideli...

2014
Tibor Kiss Francis Jeffry Pelletier Tobias Stadtfeld

The present paper describes the construction of a resource to determine the lexical preference class of a large number of English nouns (≈ 14,000) with respect to the distinction between mass and count interpretations. In constructing the lexicon, we have employed a questionnaire-based approach based on existing resources such as the Open ANC (http://www.anc.org) and WordNet (Miller, 1995). The...

Journal: :BMC Musculoskeletal Disorders 2005
Jaap J Brunnekreef Caro JT van Uden Steven van Moorsel Jan GM Kooloos

BACKGROUND In clinical practice, visual gait observation is often used to determine gait disorders and to evaluate treatment. Several reliability studies on observational gait analysis have been described in the literature and generally showed moderate reliability. However, patients with orthopedic disorders have received little attention. The objective of this study is to determine the reliabi...

2015
Minsu Ock Sang-il Lee Min-Woo Jo Jin Yong Lee Seon-Ha Kim

OBJECTIVES The purpose of this study was to assess the inter-rater reliability and intra-rater reliability of medical record review for the detection of hospital adverse events. METHODS We conducted two stages retrospective medical records review of a random sample of 96 patients from one acute-care general hospital. The first stage was an explicit patient record review by two nurses to detec...

Journal: :The Journal of burn care & rehabilitation 1995
M J Baryza G A Baryza

The Burn Scar Index, often called the Vancouver Scar Scale, is widely used in clinical practice and research to document change in scar appearance. Several sections of the Index require equipment to accurately score the items. Additionally, the numeric scores are difficult to remember. We recently devised a pocket-sized tool to aid in scoring the scar and to increase staff compliance in use of ...

Journal: :BJPsych bulletin 2018
Mick James Jon Painter Bill Buckingham Malcolm W Stewart

Aims and method The Health of the Nation Outcome Scales (HoNOS) and its older adults' version (HoNOS 65+) have been used widely for 20 years, but their glossaries have not been revised to reflect clinicians' experiences or changes in service delivery. The Royal College of Psychiatrists convened an international advisory board, with UK, Australian and New Zealand expertise, to identify desirable...

2016
Keith Godwin Paul Piwek

Question generation (QG) is the problem of automatically generating questions from inputs such as declarative sentences. The Shared Evaluation Task Challenge (QG-STEC) Task B that took place in 2010 evaluated several state-of-the-art QG systems. However, analysis of the evaluation results was affected by low inter-rater reliability. We adapted Nonaka & Takeuchi’s knowledge creation cycle to the...

2016
Borja Navarro-Colorado María Ribes-Lafoz Noelia Sánchez

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation pr...

2012
Elizabeth Baran Yaqin Yang Nianwen Xue

We propose an annotation framework to explicitly identify dropped subject pronouns in Chinese. We acknowledge and specify 10 concrete pronouns that exist as words in Chinese and 4 abstract pronouns that do not correspond to Chinese words, but that are recognized conceptually, to native Chinese speakers. These abstract pronouns are identified as “unspecified”, “pleonastic”, “event”, and “existen...

2001
Walter C. Borman

This article argues that assumptions surrounding 360” ratings should be examined; most notably, the assumptions that different rating sources have relatively unique perspectives on performance and multiple rating sources provide incremental validity over the individual sources. Studies generally support the first assumption, although reasons for interrater disagreement across different organiza...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید