Misclassification Bias in Strategy Research: Analysis and Application
نویسندگان
چکیده
Many issues in strategy and technology management are categorical. Errors in classification of the dependent variable cause systematic bias in regression estimates. In the context of the linear probability model, we characterize the misclassification bias as being proportional to the difference between two measures. One is the “bias” in the explanatory variable, conditional on false negative, multiplied by the probability of false negative. The other is the “bias” in the explanatory variable, conditional on false positive, multiplied by the probability of false positive. Further, the misclassification bias decreases with the variance of the explanatory variable. We apply the theoretical proposition to an analysis of the effect of human capital on the mobility of scientists and engineers. We show that LinkedIn profiles provide more accurate employment histories than patent records. The difference in the estimated coefficients between regressions using employment histories from LinkedIn and patent records exactly equals the bias predicted by our proposition. ∗Corresponding author: Ivan Png, NUS Business School, National University of Singapore, [email protected].
منابع مشابه
Binary Regression With a Misclassified Response Variable in Diabetes Data
Objectives: The categorical data analysis is very important in statistics and medical sciences. When the binary response variable is misclassified, the results of fitting the model will be biased in estimating adjusted odds ratios. The present study aimed to use a method to detect and correct misclassification error in the response variable of Type 2 Diabetes Mellitus (T2DM), applying binary ...
متن کاملAn application of Measurement error evaluation using latent class analysis
Latent class analysis (LCA) is a method of evaluating non sampling errors, especially measurement error in categorical data. Biemer (2011) introduced four latent class modeling approaches: probability model parameterization, log linear model, modified path model, and graphical model using path diagrams. These models are interchangeable. Latent class probability models express l...
متن کاملSensitivity analysis for misclassification in logistic regression via likelihood methods and predictive value weighting.
The potential for bias due to misclassification error in regression analysis is well understood by statisticians and epidemiologists. Assuming little or no available data for estimating misclassification probabilities, investigators sometimes seek to gauge the sensitivity of an estimated effect to variations in the assumed values of those probabilities. We present an intuitive and flexible appr...
متن کاملIs probabilistic bias analysis approximately Bayesian?
Case-control studies are particularly susceptible to differential exposure misclassification when exposure status is determined following incident case status. Probabilistic bias analysis methods have been developed as ways to adjust standard effect estimates based on the sensitivity and specificity of exposure misclassification. The iterative sampling method advocated in probabilistic bias ana...
متن کاملThe effect of misclassification on the estimation of association: a review.
Misclassification, the erroneous measurement of one or several categorical variables, is a major concern in many scientific fields and particularly in psychiatric research. Even in rather simple scenarios, unless the misclassification probabilities are very small, a major bias can arise in estimating the degree of association assessed with common measures like the risk ratio and the odds ratio....
متن کامل