Comparison of ordinary logistic regression and robust logistic regression models in modeling of pre-diabetes risk factors

Authors

  • Ali Taghipour Department of Epidemiology, Social Determinants of Health Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
  • Fateme Azizi Mayvan Department of Public Health, Neyshabur University of Medical Sciences, Neyshabur, Iran.
  • Mahsa Mokarram Department of Demographics, Student Research Committee, Islamic Azad University, Central Tehran Branch, Tehran, Iran.
  • Mehdi Jabbari Nooghabi Department of Statistics, School of Mathematics, Ferdowsi University, Mashhad, Iran.
  • Mohammad Taghi Shakeri Department of Biostatistics, Social Determinants of Health Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
Abstract:

Background: Regarding the increased risk of developing type 2 diabetes in pre-diabetic people, identifying pre-diabetes and determining of its risk factors seems so necessary. In this study, it is aimed to compare ordinary logistic regression and robust logistic regression models in modeling pre-diabetes risk factors. Methods: This is a cross-sectional study and conducted on 6460 people, over 30 years old, who have participated in the screening of diabetes plan in Mashhad city that it was done by Mashhad University of Medical Sciences from October to December 2010. According to the fasting blood sugar criteria, 5414 individuals were identified as healthy and 1046 individuals were identified as pre-diabetic. Age, gender, body mass index, systolic blood pressure, diastolic blood pressure and waist-to-hip ratio were measured for every participant. The data was entered into the Microsoft Excel 2013 (Microsoft Corp., Redmond, WA, USA) and then analysis of the data was done in R Project for Statistical Computing, Version R 3.1.2 (www.r-project.org). Ordinary logistic regression model was fitted on the data. The outliers were identified. Then Mallow, WBY and BY robust logistic regression models were fitted on the data. And then, the robust models were compared with each other and with ordinary logistic regression model according to goodness of fit and prediction ability using Pearson's chi-square and area under the receiver operating characteristic (ROC) curve respectively. Results: Among the variables that were included in the ordinary logistic regression model and three robust logistic models, age, body mass index and systolic blood pressure were statistically significant (P< 0.01) but waist-to-hip ratio was not statistically significant (P> 0.1). There were 552 outliers with misclassification error in the ordinary logistic regression model. Pearson's chi-square value and area under the ROC curve value in the Mallow model were almost the same as for ordinary logistic regression model. But it was relatively higher in BY and WBY models. Conclusion: Based on results of this study age, overweight and hypertension are risk factors of prediabetes. Also, WBY and BY models were better than ordinary logistic regression model, according to goodness of fit criteria and prediction ability.

Upgrade to premium to download articles

Sign up to access the full text

Already have an account?login

similar resources

cumulative logistic regression vs ordinary logistic regression

The common practice of collapsing inherently continuous or ordinal variables into two categories causes information loss that may potentially weaken power to detect effects of explanatory variables and result in Type II errors in statistical inference. The purpose of this investigation was to illustrate, using a substantive example, the potential increase in power gained from an ordinal instead...

full text

Predicting Type Two Diabetes and Determination of Effectiveness of Risk Factors Applying Logistic Regression Model

Background & Aim: Diabetes is one of the chronic diseases with no curative treatment; also, it is the most common cause of amputation, blindness and chronic renal failure and the most important risk factor of heart diseases. Logistic regression is one of the statistical analysis models for predicting that can be used to find out the relationship between dependent and predictor independent varia...

full text

Modeling the Risk factors of hypertension in 35-65 years old individuals using logistic regression

Introduction Hypertension is a common cause of cardiovascular disease in the world. Therefore identification of risk factors for hypertension is essential to carry out preventive masseurs. So this study was done with the aim of using logistic regression model to determine and assess the risk factors of hypertension, in Mashhad. Materials & Methods This Cross sectional study was carried out us...

full text

Multiple Logistic Regression and Model Fit Multiple Logistic Regression Just as in OLS regression, logistic models

Multiple Logistic Regression Just as in OLS regression, logistic models can include more than one predictor. The analysis options are similar to regression. One can choose to select variables, as with a stepwise procedure, or one can enter the predictors simultaneously, or they can be entered in blocks. Variations of the likelihood ratio test can be conducted in which the chi-square test (G) is...

full text

Distributionally Robust Logistic Regression

This paper proposes a distributionally robust approach to logistic regression. We use the Wasserstein distance to construct a ball in the space of probability distributions centered at the uniform distribution on the training samples. If the radius of this ball is chosen judiciously, we can guarantee that it contains the unknown datagenerating distribution with high confidence. We then formulat...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}


Journal title

volume 76  issue 7

pages  452- 458

publication date 2018-10

By following a journal you will be notified via email when a new issue of this journal is published.

Keywords

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023