Estimating the Intra-Rater Reliability of Essay Raters
نویسنده
چکیده
The intra-rater reliability in rating essays is usually indexed by the inter-rater correlation. We suggest an alternative method for estimating intra-rater reliability, in the framework of classical test theory, by using the dis-attenuation formula for inter-test correlations. The validity of the method is demonstrated by extensive simulations, and by applying it to an empirical dataset. It is recommended to use this estimation method whenever the emphasis is not on the average intra-reliability of a group of raters, but when the intrarater reliability of a specific rater is of interest, e.g., when the error-variance component of the scores is of interest in order to estimate true scores.
منابع مشابه
Raters’ Perception and Expertise in Evaluating Second Language Compositions
The consideration of rater training is very important in construct validation of a writing test because it is through training that raters are adapted to the use of students’ writing ability instead of their own criteria for assessing compositions (Charney, 1984). However, although training has been discussed in the literature of writing assessment, there is little research regarding raters’ pe...
متن کاملInter-rater and intra-rater reliability in the interpretation of MTI Photoscreener photographs of Native American preschool children.
PURPOSE To evaluate inter- and intra-rater reliability for the interpretation of MTI Photoscreener photographs taken in a population of Native American preschool children with a high prevalence of astigmatism. METHODS Photographs of 369 children were rated by 11 nonexpert and 3 expert raters. Photographs for each child were scored as pass, refer, or retake. Nonexpert raters scored photos on t...
متن کاملIntra- and inter-rater reliability for judgement of cough following citric acid inhalation.
This study investigated the inter-rater and intra-rater reliability of subjective judgements of cough in patients following inhalation of citric acid. Eleven speech-language pathologists (SLPs) currently using cough reflex testing in their clinical practice (experienced raters) and 34 SLPs with no experience using cough reflex testing (inexperienced raters) were recruited to the study. Particip...
متن کاملFunctional Movement Screen in Elite Boy Basketball Players: A Reliability Study
Purpose: To investigate the reliability of Functional Movement Screen (FMS) in basketball players. A few studies have compared the reliability of FMS between raters with different experience in athletes. The purpose of this study was to compare the FMS scoring between the beginners and expert raters using video records. Methods: This is a cross-sectional study. The study subjects compris...
متن کاملReliability of Body Landmarks Analyzer for Measuring the Quadriceps Angle
Genovarum and Genovalgum are the most common postural deformities of the knee joint. A quadriceps angle is used to measure these anomalies. Methods of measuring this angle are divided into two categories: invasive and non-invasive. The purpose of the present research was to study the inter/intra rater reliability of the non-invasive Body Landmarks Analyzer method for measuring of the quadriceps...
متن کامل