Ranking Accuracy for Logistic-GEE Models
نویسندگان
چکیده
The logistic Generalized Estimating Equations (logisticGEE) models have been extensively used for analyzing clustered binary data. However, assessing the goodness-of-fit and predictability of these models is problematic due to the fact that no likelihood is available and the observations can be correlated within a cluster. In this paper we propose a new measure for estimating the generalization performance of the logistic GEE models, namely ranking accuracy for models based on clustered data (RAMCD). We define RAMCD as the probability that a randomly selected positive observation is ranked higher than randomly selected negative observation from another cluster. We propose a computationally efficient algorithm for RAMCD. The algorithm can be applied for two cases: (1) when we estimate RAMCD as a goodness-of-fit criterion and (2) when we estimate RAMCD as a predictability criterion. This is experimentally shown on clustered data from a simulation study and a biomarkers’ study.
منابع مشابه
Comparison of logistic regression and neural network models in predicting the outcome of biopsy in breast cancer from MRI findings
Background: We designed an algorithmic model based on the logistic regression analysis and a non-algorithmic model based on the Artificial Neural Network (ANN). Materials and methods: The ability of these models was compared together in clinical application to differentiate malignant from benign breast tumors in a study group of 161 patients' records. Each patient’s record consisted of 6 subjec...
متن کاملRanking stocks of listed companies on Tehran stock exchange using a hybrid model of decision tree and logistic regression
Much research has introduced linear or nonlinear models using statistical models and machine learning tools in artificial intelligence to estimate Iran's rate of return. The primary purpose of these methods is simultaneously use different independent variables to improve stock return rates' modeling. However, in predicting the rate of return, in addition to the modeling method, the degree of co...
متن کاملPresentation of new ensemble method of Bayesian and logistic regression models in landslide susceptibility assessment in the Khalkhal Township
The aim of current research is to assess of landslide susceptibility in the Khalkhal Township, southern Ardabil using an ensemble and new method namely Bayesian and logistic regression (BT-LR) models. At first, landslide inventory map was prepared and then effective factors on landslide occurrence were identified. These factors are slope degree, plan curvature, slope aspect, elevation, landuse,...
متن کاملPrediction of mental disorders after Mild Traumatic Brain Injury: principle component Approach
Introduction: In Processes Modeling, when there is relatively a high correlation between covariates, multicollinearity is created, and it leads to reduction in model's efficiency. In this study, by using principle component analysis, modification of the effect of multicolinearity in Artificial Neural Network (ANN) and Logistic Regression (LR) has been studied. Also, the effect of multicolineari...
متن کاملPrevalence proportion ratios: estimation and hypothesis testing.
BACKGROUND Recent communications have argued that often it may not be appropriate to analyse cross-sectional studies of prevalent outcomes with logistic regression models. The purpose of this communication is to compare three methods that have been proposed for application to cross sectional studies: (1) a multiplicative generalized linear model, which we will call the log-binomial model, (2) a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016