A Review of Methods for Evaluating the Fit of Item Score Patterns on a Test
نویسندگان
چکیده
DOCUMENT RESUME Meijer, Rob A Review of Patterns on Twente Univ. Science and 1999-00-00 55p. Faculty of Educational Science and Technology, University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlands. Reports Descriptive (141) MF01/PC03 Phis Postage. *Evaluation Methods; *Goodness of Fit; *Item Response Theory; Personality Measures; *Scores; Test Construction; *Test Items *Person Fit Measures TM 030 112 R.; Sijtsma, Klaas Methods for Evaluating the Fit of Item Score a Test. Research Report 99-01. , Enschede (Netherlands). Faculty of Educational Technology. Methods are discussed that can be used to investigate the fit of an item score pattern to a test model. Model-based tests and personality inventories are administered to more than 100 million people a year and, as a result, individual fit is of great concern. Item Response Theory (IRT) modeling and person-fit statistics that are formulated in the context of IRT take a prominent place in the literature. Person-fit statistics are extensively discussed in this paper. Also, methods formulated outside the IRT context and methods to investigate particular types of response behavior are discussed. The aim of this paper is to give the researcher an idea of the possibilities in this research area by emphasizing the similarities of most person-fit methods and by discussing the pros and cons of the methods. (Contains 98 references and a list of University of Twente research reports.) (Author/SLD) ******************************************************************************** * Reproductions supplied by EDRS are the best that can be made * * from the original document. * ********************************************************************************
منابع مشابه
Diagnosing item score patterns on a test using item response theory-based person-fit statistics.
Person-fit statistics have been proposed to investigate the fit of an item score pattern to an item response theory (IRT) model. The author investigated how these statistics can be used to detect different types of misfit. Intelligence test data were analyzed using person-fit statistics in the context of the G. Rasch (1960) model and R. J. Mokken's (1971, 1997) IRT models. The effect of the cho...
متن کاملDetection of Aberrant Item Score Patterns : A Review
Methods for detecting item score patterns that are unlikely (aberrant) given that a parametric item response theory (IRT) model gives an adequate description of the data or given the responses of the other persons in the group are discussed. The emphasis here is on the latter group of statistics. These statistics can be applied when a nonparametric model is used to fit the data or when the data...
متن کاملReview Psychometric Parameters of the 29th Residency Test (1380) According to the Classic Test Theory (CTT)
Introduction. To select the best group, and to make a good decision, are of the most important worries of the health and medical education ministry and also all entrants in the residency test. Having and performing a reliable and good exam will reduce doubts to a great deal. Considering different scientific methods consist of (precisely review of curriculum by the designer committee, sampling o...
متن کاملDeveloping and Validating Tool for Assessing the Field Internship Course in the Field of Occupational Health Engineering
Background and aims: Training process is any learning-based activity and experience, performed with the aim of causing relatively fixed and stable changes in people to improve their ability to do a job. In this regard, the apprenticeship course is one of the most important educational courses, especially for action-oriented or practical fields of studies of the universities. In these fields, a...
متن کاملSelecting the Best Fit Model in Cognitive Diagnostic Assessment: Differential Item Functioning Detection in the Reading Comprehension of the PhD Nationwide Admission Test
This study was an attemptto provide detailed information of the strengths and weaknesses of test takers‟ real ability through cognitive diagnostic assessment, and to detect differential item functioning in each test item. The rationale for using CDA was that it estimates an item‟s discrimination power, whereas clas- sical test theory or item response theory depicts between rather within item mu...
متن کامل