Multi-Stage Variable Selection: Screen and Clean
نویسندگان
چکیده
This paper explores the following question: what kind of statistical guarantees can be given when doing variable variable in high dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis testing to eliminate some variables. We refer to the first two stages as “screening” and the last stage as “cleaning.” We consider three screening methods: the lasso, marginal regression, and forward stepwise regression. Our method also gives consistent variable selection under weak conditions.
منابع مشابه
Modeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions
In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...
متن کاملModeling and Optimization of Industrial Multi-Stage Compressed Air System Using Actual Variable Effectiveness in Hot Regions
In this article, modeling and optimization of power consumption of two–stage compressed air system has been investigated. To do so, the two – stage compressed air cycle with intercooler of FAJR Petroleum Company was considered. This cycle includes two centrifugal compressors, a shell, and a tube intercooler. For modeling of power consumption, actual compressors isentropic efficiencies and inter...
متن کاملOptimality of graphlet screening in high dimensional variable selection
Consider a linear model Y = Xβ + σz, where X has n rows and p columns and z ∼ N(0, In). We assume both p and n are large, including the case of p n. The unknown signal vector β is assumed to be sparse in the sense that only a small fraction of its components is nonzero. The goal is to identify such nonzero coordinates (i.e., variable selection). We are primarily interested in the regime where s...
متن کاملHigh Dimensional Variable Selection.
This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis tes...
متن کاملA fuzzy random multi-objective approach for portfolio selection
In this paper, the portfolio selection problem is considered, where fuzziness and randomness appear simultaneously in optimization process. Since return and dividend play an important role in such problems, a new model is developed in a mixed environment by incorporating fuzzy random variable as multi-objective nonlinear model. Then a novel interactive approach is proposed to determine the pref...
متن کامل