A Pseudo Knockoff Filter for Correlated Features
نویسندگان
چکیده
In 2015, Barber and Candès introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and prove that this method achieves exact FDR control. Inspired by the work of Barber and Candès (2015), we propose and analyze a pseudoknockoff filter that inherits some advantages of the original knockoff filter and has more flexibility in constructing its knockoff matrix. Although we have not been able to obtain exact FDR control of the pseudo knockoff filter, we show that it satisfies an expectation inequality that offers some insight into FDR control. Moreover, we provide some partial analysis of the pseudo knockoff filter for the half Lasso and the least squares statistics. Our analysis indicates that the inverse of the covariance matrix of the feature matrix plays an important role in designing and analyzing the pseudo knockoff filter. Our preliminary numerical experiments show that the pseudo knockoff filter with the half Lasso statistic has FDR control. Moreover, our numerical experiments show that the pseudo-knockoff filter could offer more power than the original knockoff filter with the OMP or Lasso Path statistic when the features are correlated and non-sparse.
منابع مشابه
Some Analysis of the Knockoff Filter and its Variants
In many applications, we need to study a linear regression model that consists of a response variable and a large number of potential explanatory variables and determine which variables are truly associated with the response. In 2015, Barber and Candès introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and proved that this method a...
متن کاملThe knockoff filter for FDR control in group-sparse and multitask regression
We propose the group knockoff filter, a method for false discovery rate control in a linear regression setting where the features are grouped, and we would like to select a set of relevant groups which have a nonzero effect on the response. By considering the set of true and false discoveries at the group level, this method gains power relative to sparse regression methods. We also apply our me...
متن کاملA knockoff filter for high-dimensional selective inference
This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split into two groups, where the first group is used to screen for a set of potentially relevant variables, whereas the second is used for inference over this redu...
متن کاملA Novel Structure for Realization of a Pseudo Two Path Band-Pass Filter
In this paper, a modified auto zeroed integrator is used to design and simulate a low-voltage high-Q switched capacitor pseudo 2-path filter. The filter is a sixth–order Chebyshev band-pass filter operating at sampling frequency of 1MHz and center frequency of 250 kHz with a quality factor of 50. The proposed filter has both low-voltage and high speed properties of the auto zeroed integrators ...
متن کاملn-fold Obstinate Filters in Pseudo-Hoop Algebras
In this paper, we introduce the concepts of n-fold obstinate pseudo-hoop and n-fold obstinate filter in pseudo-hoops. Then we investigated these notions and proved some properties of them. Also, we discussed the relationship between n-fold obstinate pseudo-hoop and n-fold obstinate filter and other types of n-fold pseudo-hoops and n-fold filters such as n-fold (positive) implicative filter and ...
متن کامل