Efficiencies and Surrogate Variables in Logistic Regression
نویسنده
چکیده
Raymond J. Carroll Department of Statistics Texas A&M University College Station, TX 77843 We study logistic regression with response y when the true predictor x is measured with· error and the observable data consist of pairs (y,w), where w is correlated with x. Two approaches to estimation are studied. In the first, integrated likelihood estimates are obtained from the conditional distribution of y given E(x Iw). In the second approach, the calibration curve for (x, w) is used to construct unbiased estimates of x and then errors in variables estimation techniques are employed. When (x, w) has a bivariate normal distribution, the two approaches yield consistent and asymptotically normally distributed parameter estimates. The intergrated likelihood estimates are more efficient here, although the errors in variables estimates are shown to be nearly efficient for many choices of the parameters. Our results also apply to the problem of assessing the loss of efficiency in semiparametric regression models, where in this context it is shown that there is often little potential gain for specifying the distribution of x completely.
منابع مشابه
بهکارگیری متغیرهای پنهان در مدل رگرسیون لجستیک برای حذف اثر همخطی چندگانه در تحلیل برخی عوامل مرتبط با سرطان پستان
Background and Objectives: Logistic regression is one of the most widely used generalized linear models for analysis of the relationships between one or more explanatory variables and a categorical response. Strong correlations among explanatory variables (multicollinearity) reduce the efficiency of model to a considerable degree. In this study we used latent variables to reduce the effects of ...
متن کاملModeling Past Vegetation Change Through Remote Sensing and G.I.S: A Comparison of Neural Networks and Logistic Regression Methods
Past change in remnant vegetation patches was modeled using remotely-sensed MSS and TM imagery and G.I.S. The images covering 27 years from 1973 to 2000 were used in detecting change in vegetation through post-classification comparison and modeling it through neural networks and logistic regression methods. Physical factors, image-based layers and landscape metrics provided the 19 predictor var...
متن کاملComparison of Random Forest and Logistic Regression Methods in Predicting Mortality in Colorectal Cancer Patients and its Related Factors
Background and Objectives: The purpose of this study was to predict the mortality rate of colorectal cancer in Iranian patients and determine the effective factors on the mortality of patients with colorectal cancer using random forest and logistic regression methods. Methods: Data from 304 patients with colorectal cancer registry from the Gastroenterology and Liver Research Center of Shah...
متن کاملFactors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis
Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...
متن کاملOnm-4: Reproductive Factors and Breast Cancer Incidence in Premenopousal Patients: A Case-Control Study
Background: Menstrual characteristics may serve as surrogate measures of endogenous estrogen and may be related to breast cancer risk. This case-control study investigated the relationship between menstrual factors and risk of breast cancer incidence in Iranian Center for Breast Cancer (ICBC). Materials and Methods: Forty seven Cases of breast cancer referred to ICBC and 77 controls were studie...
متن کامل