A Fast Parsimonious Maximum Likelihood Approach for Predicting Outcome Variables from a Large Number of Predictors
نویسنده
چکیده
A new model with K correlated components is presented for predicting outcome variables where the number of predictors G may exceed the total sample size N. A fast maximum likelihood algorithm provides closed-form expressions for the model parameters and statistical tests for determining the number of components. We also propose a way to reduce the number of predictors in a stepwise fashion, at each step eliminating the least important predictor based on a new measure of predictor importance. When at least one suppressor variable is included among the predictors, the new model predicts and validates better than traditional models, especially when G is large.
منابع مشابه
Review and evaluation of penalised regression methods for risk prediction in low‐dimensional data with few events
Risk prediction models are used to predict a clinical outcome for patients using a set of predictors. We focus on predicting low-dimensional binary outcomes typically arising in epidemiology, health services and public health research where logistic regression is commonly used. When the number of events is small compared with the number of regression coefficients, model overfitting can be a ser...
متن کاملDesigning a Prognostic Scoring System for Predicting the Outcomes of Proximal Fifth Metatarsal Fractures at 20 Weeks
Background: Fractures of the proximal fifth metatarsal bone are among the most common fractures observed in the foot and their classification and management has been subject to much discussion and disagreement. In this study, we aim to identify and quantify the effect of possible predictors of the outcome of the treatment of proximal fifth metatarsal fractures.Methods: Patients with established...
متن کاملImproving the Performance of Bayesian Estimation Methods in Estimations of Shift Point and Comparison with MLE Approach
A Bayesian analysis is used to detect a change-point in a sequence of independent random variables from exponential distributions. In This paper, we try to estimate change point which occurs in any sequence of independent exponential observations. The Bayes estimators are derived for change point, the rate of exponential distribution before shift and the rate of exponential distribution after s...
متن کاملLong-Term Prediction of Time Series using a Parsimonious Set of Inputs and LS-SVM
Time series prediction is an important problem in many areas of science and engineering. We investigate the use of a parsimonious set of autoregressive variables in the long-term prediction task using the direct prediction approach. We use a fast input selection algorithm on a large set of autoregressive variables for different direct predictors, and train nonlinear models (LS-SVM and a committ...
متن کاملA comparison of algorithms for maximum likelihood estimation of Spatial GLM models
In spatial generalized linear mixed models, spatial correlation is assumed by adding normal latent variables to the model. In these models because of the non-Gaussian spatial response and the presence of latent variables the likelihood function cannot usually be given in a closed form, thus the maximum likelihood approach is very challenging. The main purpose of this paper is to introduce two n...
متن کامل