Net risk reclassification p values: valid or misleading?

نویسندگان

  • Margaret S Pepe
  • Holly Janes
  • Christopher I Li
چکیده

BACKGROUND The Net Reclassification Index (NRI) and its P value are used to make conclusions about improvements in prediction performance gained by adding a set of biomarkers to an existing risk prediction model. Although proposed only 5 years ago, the NRI has gained enormous traction in the risk prediction literature. Concerns have recently been raised about the statistical validity of the NRI. METHODS Using a population dataset of 10000 individuals with an event rate of 10.2%, in which four biomarkers have no predictive ability, we repeatedly simulated studies and calculated the chance that the NRI statistic provides a positive statistically significant result. Subjects for training data (n = 420) and test data (n = 420 or 840) were randomly selected from the population, and corresponding NRI statistics and P values were calculated. For comparison, the change in the area under the receiver operating characteristic curve and likelihood ratio statistics were calculated. RESULTS We found that rates of false-positive conclusions based on the NRI statistic were unacceptably high, being 63.0% in the training datasets and 18.8% to 34.4% in the test datasets. False-positive conclusions were rare when using the change in the area under the curve and occurred at the expected rate of approximately 5.0% with the likelihood ratio statistic. CONCLUSIONS Conclusions about biomarker performance that are based primarily on a statistically significant NRI statistic should be treated with skepticism. Use of NRI P values in scientific reporting should be halted.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of risk prediction by genomic profiling: reclassification measures versus the area under the receiver operating characteristic curve.

Reclassification is observed even when there is no or minimal improvement in the area under the receiver operating characteristic curve (AUC), and it is unclear whether it indicates improved clinical utility. The authors investigated total reclassification, net reclassification improvement, and integrated discrimination improvement for different DeltaAUC using empirical and simulated data. Empi...

متن کامل

Evaluating performance of the spetzler-martin supplemented model in selecting patients with brain arteriovenous malformation for surgery.

BACKGROUND AND PURPOSE Our recently proposed point scoring model includes the widely-used Spetzler-Martin (SM)-5 variables, along with age, unruptured presentation, and diffuse border (SM-Supp). Here we evaluate the SM-Supp model performance compared with SM-5, SM-3, and Toronto prediction models using net reclassification index, which quantifies the correct movement in risk reclassification, a...

متن کامل

Problems with risk reclassification methods for evaluating prediction models.

For comparing the performance of a baseline risk prediction model with one that includes an additional predictor, a risk reclassification analysis strategy has been proposed. The first step is to cross-classify risks calculated according to the 2 models for all study subjects. Summary measures including the percentage of reclassification and the percentage of correct reclassification are calcul...

متن کامل

Practice of Epidemiology Problems With Risk Reclassification Methods for Evaluating Prediction Models

For comparing the performance of a baseline risk prediction model with one that includes an additional predictor, a risk reclassification analysis strategy has been proposed. The first step is to cross-classify risks calculated according to the 2 models for all study subjects. Summary measures including the percentage of reclassification and the percentage of correct reclassification are calcul...

متن کامل

Reproductive Risk Factors and Coronary Heart Disease in the Women's Health Initiative Observational Study.

BACKGROUND Reproductive factors provide an early window into a woman's coronary heart disease (CHD) risk; however, their contribution to CHD risk stratification is uncertain. METHODS AND RESULTS In the Women's Health Initiative Observational Study, we constructed Cox proportional hazards models for CHD including age, pregnancy status, number of live births, age at menarche, menstrual irregula...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the National Cancer Institute

دوره 106 4  شماره 

صفحات  -

تاریخ انتشار 2014