Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models.
نویسنده
چکیده
In regression analysis, the notion of population validity is of theoretical interest for describing the usefulness of the underlying regression model, whereas the presumably more important concept of population cross-validity represents the predictive effectiveness for the regression equation in future research. It appears that the inference procedures of the squared multiple correlation coefficient have been extensively developed. In contrast, a full range of statistical methods for the analysis of the squared cross-validity coefficient is considerably far from complete. This article considers a distinct expression for the definition of the squared cross-validity coefficient as the direct connection and monotone transformation to the squared multiple correlation coefficient. Therefore, all the currently available exact methods for interval estimation, power calculation, and sample size determination of the squared multiple correlation coefficient are naturally modified and extended to the analysis of the squared cross-validity coefficient. The adequacies of the existing approximate procedures and the suggested exact method are evaluated through a Monte Carlo study. Furthermore, practical applications in areas of psychology and management are presented to illustrate the essential features of the proposed methodologies. The first empirical example uses 6 control variables related to driver characteristics and traffic congestion and their relation to stress in bus drivers, and the second example relates skills, cognitive performance, and personality to team performance measures. The results in this article can facilitate the recommended practice of cross-validation in psychological and other areas of social science research.
منابع مشابه
A Novel QSAR Model for the Evaluation and Prediction of (E)-N’-Benzylideneisonicotinohydrazide Derivatives as the Potent Anti-mycobacterium Tuberculosis Antibodies Using Genetic Function Approach
Abstract A dataset of (E)-N’-benzylideneisonicotinohydrazide derivatives as a potent anti-mycobacterium tuberculosis has been investigated utilizing Quantitative Structure-Activity Relationship (QSAR) techniques. Genetic Function Algorithm (GFA) and Multiple Linear Regression Analysis (MLRA) were used to select the descriptors and to generate the correlation QSAR models that relate the Mi...
متن کاملPrediction of Patient Controlled Analgesic Consumption Using Patient Demand Behaviours
Many factors affect individual variability in postoperative pain. Although several statistical studies have evaluated postoperative pain and analgesic consumption, previous research shows that the coefficient of determination of existing predictive models was small (e.g., R = 0.17–0.59 for postoperative pain, and 0.27–0.46 for postoperative analgesic consumption). This study presents the real-w...
متن کاملPredictive validity of the comprehensive basic science examination mean score for assessment of medical students’ performance
Introduction. Medical education curriculum improvements can be achieved by evaluating students’ performance. Medical students have to pass two undergraduate comprehensive examinations, basic science and preinternship, in Iran. To measure validity of the students’ mean score in comprehensive basic science exam (CBSE) for predicting their performance in later curriculum phases. Methods. This de...
متن کاملPackage 'cvq2' Title Calculate the Predictive Squared Correlation Coefficient
February 19, 2015 Type Package Title Calculate the predictive squared correlation coefficient Version 1.2.0 Date 2013-10-10 Author Torsten Thalheim Maintainer Torsten Thalheim Description The external prediction capability of quantitative structure-activity relationship (QSAR) models is often quantified using the predictive squared correlation coefficient. This value ca...
متن کاملSupport Vector Regression Model of Chlorophyll-a during Spring Algal Bloom in Xiangxi Bay of Three Gorges Reservoir, China
To study the relationship between chlorophyll-a and environmental variables during spring algal bloom in Xiangxi Bay of Three Gorges Reservoir, the support vector regression (SVR) model was established. In surveys, 11 stations have been investigated and 264 samples were collected weekly from March 4 to May 13 in 2007 and February 16 to May 10 in 2008. The parameters in SVR model were optimized ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Multivariate behavioral research
دوره 44 1 شماره
صفحات -
تاریخ انتشار 2009