Multi-Stage Variable Selection: Screen and Clean

نویسندگان

  • Larry Wasserman
  • Kathryn Roeder
چکیده

This paper explores the following question: what kind of statistical guarantees can be given when doing variable variable in high dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis testing to eliminate some variables. We refer to the first two stages as “screening” and the last stage as “cleaning.” We consider three screening methods: the lasso, marginal regression, and forward stepwise regression. Our method also gives consistent variable selection under weak conditions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions

In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...

متن کامل

Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions

In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...

متن کامل

Optimality of graphlet screening in high dimensional variable selection

Consider a linear model Y = Xβ + σz, where X has n rows and p columns and z ∼ N(0, In). We assume both p and n are large, including the case of p n. The unknown signal vector β is assumed to be sparse in the sense that only a small fraction of its components is nonzero. The goal is to identify such nonzero coordinates (i.e., variable selection). We are primarily interested in the regime where s...

متن کامل

High Dimensional Variable Selection.

This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis tes...

متن کامل

A fuzzy random multi-objective approach for portfolio selection

In this paper, the portfolio selection problem is considered, where fuzziness and randomness appear simultaneously in optimization process. Since return and dividend play an important role in such problems, a new model is developed in a mixed environment by incorporating fuzzy random variable as multi-objective nonlinear model. Then a novel interactive approach is proposed to determine the pref...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007