Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
نویسندگان
چکیده
BACKGROUND Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC.Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. RESULTS The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. CONCLUSIONS On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.
منابع مشابه
Ordinal Regression Analysis: Fitting the Proportional Odds Model Using Stata, SAS and SPSS
Researchers have a variety of options when choosing statistical software packages that can perform ordinal logistic regression analyses. However, statistical software, such as Stata, SAS, and SPSS, may use different techniques to estimate the parameters. The purpose of this article is to (1) illustrate the use of Stata, SAS and SPSS to fit proportional odds models using educational data; and (2...
متن کاملبه کارگیری مدلهای رگرسیون لجستیک ترتیبی در مطالعات کیفیت زندگی
Background & Objectives: Due to the increasing tendency to measure the quality of life in recent years and the extensive quality of life questionnaires, it is important to determine the appropriate method of analyzing data derived from these studies. The aim of the present study was to introduce ordinal logistic regression models as an appropriate method for analyzing the data of quality of li...
متن کاملTechnical note: Analysis of ordinal outcomes from carcass data in beef cattle research.
Beef cattle research commonly uses Yield grade (YG) and Quality grade (QG) as outcomes in nutrition and health experiments. These outcomes, as commonly reported and analyzed, are ordinal variables with an assumed rank derived from an underlying latent variable that may or may not be available for analysis. The objective of this study was to employ mixed-effects ordinal regression and approaches...
متن کاملFitting Stereotype Logistic Regression Models for Ordinal Response Variables in Educational Research (Stata)
The stereotype logistic (SL) model is an alternative to the proportional odds (PO) model for ordinal response variables when the proportional odds assumption is violated. This model seems to be underutilized. One major reason is the constraint of current statistical software packages. Statistical Package for the Social Sciences (SPSS) cannot perform the SL regression analysis, and SAS does not ...
متن کاملMultilevel Models for Ordinal and Nominal Variables
Reflecting the usefulness of multilevel analysis and the importance of categorical outcomes in many areas of research, generalization of multilevel models for categorical outcomes has been an active area of statistical research. For dichotomous response data, several approaches adopting either a logistic or probit regression model and various methods for incorporating and estimating the influen...
متن کامل