Reliability measures in item response theory: manifest versus latent correlation functions.
نویسندگان
چکیده
For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended.
منابع مشابه
The Comparison of Two Models for Evaluation of Pre-internship Comprehensive Test: Classical and Latent Trait
Introduction: Despite the widespread use of pre-internship comprehensive test and its importance in medical students’ assessment, there is a paucity of the studies that can provide a systematic psychometric analysis of the items of this test. Thus, the present study sought to assess March 2011 pre-internship test using classical and latent trait models and compare their results. Methods: In th...
متن کاملA mixed-binomial model for Likert-type personality measures
Personality measurement is based on the idea that values on an unobservable latent variable determine the distribution of answers on a manifest response scale. Typically, it is assumed in the Item Response Theory (IRT) that latent variables are related to the observed responses through continuous normal or logistic functions, determining the probability with which one of the ordered response al...
متن کاملA Geometrical Approach to Item Response Theory
How critical is the concept of the latent trait to modern test theory ? The appeal to some unobservable characteristic modulating response probability can lead to some confusion and misunderstanding among users of psychometric technology. This paper looks at a geometric formulation of item response theory that avoids the need to appeal to unobservables. It draws on concepts in differential geom...
متن کاملمدل معادلات ساختاری و کاربرد آن در مطالعات روانشناسی: یک مطالعه مروری
Introduction: Structural Equation Modeling (SEM) is a very general statistical modeling technique, which is widely used in the behavioral sciences. It can be viewed as a combination of path analysis, regression and factor analysis. One of the prominent features of this method is the ability to compute direct, indirect and total effects, as well as latent variable modeling. Methods: This sy...
متن کاملHow Item Response Theory can solve problems of ipsative data
.......................................................................................................... 3 Introduction ..................................................................................................... 4 Single-stimulus response format ................................................................... 4 Response biases affecting single-stimulus items ........................
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The British journal of mathematical and statistical psychology
دوره 68 1 شماره
صفحات -
تاریخ انتشار 2015