Use of reclassification for assessment of improved prediction: an empirical evaluation.

نویسندگان

  • Ioanna Tzoulaki
  • George Liberopoulos
  • John P A Ioannidis
چکیده

BACKGROUND An increasing number of studies evaluate the ability of predictors to change risk stratification and alter medical decisions, i.e. reclassification performance. We examined the reported design and analysis of recent studies of reclassification and the robustness of their claims for improved reclassification. METHODS Two independent investigators searched PubMed and citations to the article that introduced the currently most popular reclassification metric (net reclassification index, NRI) to identify studies performing reclassification analysis (January 2006-January 2010). We focused on articles that included any analyses comparing the performance of a baseline predictive model vs the baseline model plus some additional predictor for a prospectively assessed outcome. We recorded information on the baseline model used, outcomes assessed, choice of risk thresholds and features of reclassification analyses. RESULTS Of 58 baseline models used in 51 eligible papers, only 14 (24%) were previously described, used as described and had same outcomes as originally intended. Calibration was examined in 53% of the studies. Sixteen studies (31%) provided a reference for the choice of risk thresholds and only six used the previously proposed categories or justified the use of alternative thresholds. Only 14 studies (27%) stated that the chosen risk thresholds had different therapeutic intervention implications. NRI was calculated in 38 studies and was smaller in studies with adequately referenced or justified risk thresholds vs others (P < 0.0001). CONCLUSIONS Reclassification studies would benefit from more rigorous methodological standards; otherwise claims for improved reclassification may remain spurious.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Bayesian and Frequentist Methods in Estimating the Net Reclassification and Integrated Discrimination Improvement Indices for Evaluation of Prediction Models: Tehran Lipid and Glucose Study

Introduction: The Frequency-based method is commonly used to estimate the Net Reclassification Improvement (NRI)- and Integrated Discrimination Improvement (IDI) indices. These indices measure the magnitude of the performance of statistical models when a new biomarker is added. This method has poor performance in some cases, especially in small samples. In this study, the performance of two Bay...

متن کامل

Improvement of risk prediction by genomic profiling: reclassification measures versus the area under the receiver operating characteristic curve.

Reclassification is observed even when there is no or minimal improvement in the area under the receiver operating characteristic curve (AUC), and it is unclear whether it indicates improved clinical utility. The authors investigated total reclassification, net reclassification improvement, and integrated discrimination improvement for different DeltaAUC using empirical and simulated data. Empi...

متن کامل

Measuring the effectiveness of human resource information systems in national iranian oil company an empirical assessment

While the growth of MIS investment and its influence is making MIS evaluation ever more indispensable, little attention has been paid to assessing and communicating system effectiveness. This paper attempts to empirically assess the effectiveness of integrated human resource information system in Iranian oil industry. As suggested by recent research, the widely accepted IS success model is...

متن کامل

Evaluation of the Incremental Prognostic Utility of Increasingly Complex Testing in Chronic Heart Failure.

BACKGROUND Current heart failure (HF) risk prediction models do not consider how individual patient assessments occur in incremental steps; furthermore, each additional diagnostic evaluation may add cost, complexity, and potential morbidity. METHODS AND RESULTS Using a cohort of well-treated ambulatory HF patients with reduced ejection fraction who had complete clinical, laboratory, health-re...

متن کامل

Reliability Assessment of Shallow Domes Using a Semi-Empirical Evaluation Procedure

Like other structures, shallow domes have imperfections from the prescribed values obtained by specifications during the construction process. Specifications define some tolerance values for imperfections. Despite consideration of these values, the critical load of a dome varies for each imperfection pattern. So the reliability plays an important role in domes safety. Theoretical evaluation pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International journal of epidemiology

دوره 40 4  شماره 

صفحات  -

تاریخ انتشار 2011