A Fast Parsimonious Maximum Likelihood Approach for Predicting Outcome Variables from a Large Number of Predictors

نویسنده

  • Jay Magidson
چکیده

A new model with K correlated components is presented for predicting outcome variables where the number of predictors G may exceed the total sample size N. A fast maximum likelihood algorithm provides closed-form expressions for the model parameters and statistical tests for determining the number of components. We also propose a way to reduce the number of predictors in a stepwise fashion, at each step eliminating the least important predictor based on a new measure of predictor importance. When at least one suppressor variable is included among the predictors, the new model predicts and validates better than traditional models, especially when G is large.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Review and evaluation of penalised regression methods for risk prediction in low‐dimensional data with few events

Risk prediction models are used to predict a clinical outcome for patients using a set of predictors. We focus on predicting low-dimensional binary outcomes typically arising in epidemiology, health services and public health research where logistic regression is commonly used. When the number of events is small compared with the number of regression coefficients, model overfitting can be a ser...

متن کامل

Designing a Prognostic Scoring System for Predicting the Outcomes of Proximal Fifth Metatarsal Fractures at 20 Weeks

Background: Fractures of the proximal fifth metatarsal bone are among the most common fractures observed in the foot and their classification and management has been subject to much discussion and disagreement. In this study, we aim to identify and quantify the effect of possible predictors of the outcome of the treatment of proximal fifth metatarsal fractures.Methods: Patients with established...

متن کامل

Improving the Performance of Bayesian Estimation Methods in Estimations of Shift Point and Comparison with MLE Approach

A Bayesian analysis is used to detect a change-point in a sequence of independent random variables from exponential distributions. In This paper, we try to estimate change point which occurs in any sequence of independent exponential observations. The Bayes estimators are derived for change point, the rate of exponential distribution before shift and the rate of exponential distribution after s...

متن کامل

Long-Term Prediction of Time Series using a Parsimonious Set of Inputs and LS-SVM

Time series prediction is an important problem in many areas of science and engineering. We investigate the use of a parsimonious set of autoregressive variables in the long-term prediction task using the direct prediction approach. We use a fast input selection algorithm on a large set of autoregressive variables for different direct predictors, and train nonlinear models (LS-SVM and a committ...

متن کامل

A comparison of algorithms for maximum likelihood estimation of Spatial GLM models

In spatial generalized linear mixed models, spatial correlation is assumed by adding normal latent variables to the model. In these models because of the non-Gaussian spatial response and the presence of latent variables the likelihood function cannot usually be given in a closed form, thus the maximum likelihood approach is very challenging. The main purpose of this paper is to introduce two n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010