Model selection strategies for identifying most relevant covariates in homoscedastic linear models

نویسندگان

  • Aleksey Min
  • Hajo Holzmann
  • Claudia Czado
چکیده

We propose a new method in two variations for the identification of most relevant covariates in linear models with homoscedastic errors. In contrast to AIC, BIC and other information criteria, our method is based on an interpretable scaled quantity. This quantity measures a maximal relative error one makes by selecting covariates from a given set of all available covariates. The proposed model selection procedures rely on asymptotic normality of test statistics, and therefore normality of the errors in the regression model is not required. In a simulation study the performance of the suggested methods along with the performance of the standard model selection criteria AIC and BIC is examined. The simulation study illustrates the evident superiority of the proposed method over the AIC and the BIC, and especially when regression effects possess influence of several orders 1 in magnitude. The accuracy of the normal approximation to the test statistics is also investigated. The normal approximation is already satisfactory for sample sizes 50 and 100. As an illustration we analyze US college spending data from 1994.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Panning for Gold: Model-X Knockoffs for High-dimensional Controlled Variable Selection

Many contemporary large-scale applications involve building interpretable models linking a large set of potential covariates to a response in a nonlinear fashion, such as when the response is binary. Although this modeling problem has been extensively studied, it remains unclear how to effectively control the fraction of false discoveries even in high-dimensional logistic regression, not to men...

متن کامل

Differenced-Based Double Shrinking in Partial Linear Models

Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...

متن کامل

Efficient estimation and model selection for single-index varying-coefficient models

The single-index varying-coefficient models include many types of popular semiparametric models, i.e. single-index models, partially linear models, varying-coefficient models, and so on. In this paper, we first establish the semiparametric efficiency bound for the single-index varying-coefficient model, and develop an estimation method based on the efficient estimating equations. Although our m...

متن کامل

PROVIDING A MODEL FOR THE SUPPLIER SELECTION PROCESS IN THE SUPPLY CHAIN MANAGEMENT WITH HYBRID MODEL OF DECISION MAKING

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...

متن کامل

PROVIDING A MODEL FOR THE SUPPLIER SELECTION PROCESS IN THE SUPPLY CHAIN MANAGEMENT WITH HYBRID MODEL OF DECISION MAKING

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 54  شماره 

صفحات  -

تاریخ انتشار 2010