How to use SAS® to fit Multiple Logistic Regression Models
نویسنده
چکیده
When response outcomes are continuous error terms in models are normally distributed and a standard normal distribution function is adequate. The logistic distribution function which is very similar to the normal distribution function is required when the response variable is binary. Parameters of a logistic response function are often estimated using the method of maximum likelihood (ML). One of the problems with ML estimation is that, no closed-form solution exists for the values of the parameters that maximize the loglikelihood function. Hence sophisticated computer-intensive numerical search procedures (i.e: Newton Raphson) are required to find ML estimates of parameters. This paper is a step by step guide to develop a multiple logistic regression model for data sets with binary response variable using PROC LOGISTIC in SAS®. Since PROC LOGISTIC requires uniform coding and does not accommodate missing data, data need be corrected for missing values and for outliers, those can reduce the efficiency of ML estimation. In addition, best subset among 25 predictor variables was selected. This step is useful to increase the speed of ML estimation. Complete model diagnostics tests, sensitivity and residual analyses were performed. ML parameter estimation was significant at 5%level. P-value of Hosmer and Lemeshow goodness of fit test was 0.9545. Model had a maximum adjusted R-square value equal to 0.78, showed 93.2% sensitivity and 79.5% specificity. Residuals from the model did not show significant patterns and were normally distributed. Odds ratios of the predictor variables supported the conclusions drawn from ML estimation.
منابع مشابه
Hierarchical Logistic Regression Modeling with SAS GLIMMIX
Data often have hierarchical or clustered structures, such as patients clustered within hospitals or students nested within schools. Hierarchical models are statistical models that are used to analyze hierarchical or multilevel data. SAS GLIMMIX procedure is a new and highly useful tool for hierarchical modeling with discrete responses. This paper is focused on hierarchical logistic regression ...
متن کاملFitting Bayesian hierarchical multinomial logit models in PROC MCMC
The paper illustrates how to use the MCMC procedure to fit a hierarchical, multinomial logit model for a nominal response variable with correlated responses in a Bayesian framework. In particular, the paper illustrates how to perform three important parts of Bayesian model fitting. First, to make sure appropriate prior distributions are selected, the paper shows how to simulate draws directly f...
متن کاملA family of ordered logistic regression models fit by data expansion.
Sirs—One of us (SRC) coauthored a paper in the International Journal of Epidemiology that detailed a use of data expansion to fit the continuation-ratio model1 as sketched by Armstrong and Slone,2 as well as a paper in the Annals of Epidemiology that detailed a related use of data expansion to fit proportional-odds models.3 In fact, a family of ordered logistic regression models, including cont...
متن کاملComparing Multi-level and Ordinary Logistic Regression Models in Evaluating Factors Related to Periodontal Clinical Attachment Loss
Background and Objectives: Periodontal disease is one of the most common oral health problems. Clinical attachment loss occurs in sever periodontal cases (CAL>3). In this study, we applied a classic regression model and the models that consider the hierarchical structure of the data to estimate and compare the effect of different factors on CAL. Methods: This cross-sectional study was perfo...
متن کاملA SAS/IML Macro for Goodness-of-Fit Testing in Logistic Regression Models with Sparse Data
The logistic regression model has become the standard analyzing tool for binary responses in a variety of disciplines. Methods for assessing goodness-of-fit, however, are less developed and this is especially pronounced in calculating goodness-of-fit tests with sparse data, when the standard tests (deviance and Pearson test) behave unsatisfactorily. In our paper we show two solutions to the pro...
متن کامل