Kernel Quantile Estimators
نویسنده
چکیده
SUMMARY The estimation of population quantiles is of great interest when one is not prepared to assume a parametric form for the u.nderlying distribution. In addition, quantiles often arise as the natural thing to estimate when the underlying distribution is skewed. The sample quantile is a popular nonparametric estimator of the corresponding population quantile. Being a function of at most two order statistics, sample quantiles experience a substantial loss of efficiency for distributions such as the normal. An obvious way to improve efficiency is to form a weighted average of several order statistics, using an appropriate weight function. Such estimators are called L-estimators. The problem then becomes one of choosing the weight function. One class of L-estimators, which uses a density function (called a kernel) as its weight function, are called kernel quantile estimators. The effective performance of such estimators depends critically on the selection of a smoothing parameter. An important part of this paper is a theoretical analysis of this selection. In particular, we obtain an expression for the value of the smoothing parameter which minimizes asymptotic mean square error. Another key feature of this paper is that this expression is then used to develop a practical data-based method for smoothing parameter selection. Other L-estimators of quantiles have been proposed by Harrell and Davis estimator is just a bootstrap estimator (Section 1). An important aspect of this paper is that we show that asymptotically all of these are kernel estimators with a Gaussian kernel and we identify the bandwidths. It is seen that the choices of smoothing parameter inherent in both the Harrell and Davis estimator and the Brewer estimator are asymptotically suboptimal. Our theory also suggests a method for choosing a previously not understood tuning parameter in the Kaigh-Lachenbruch estimator. The final point is an investigation of how much reliance should be placed on the theoretical results, through a simulation study. We compare one of the kernel estimators, using data-based bandwidths, with the Harrell-Davis and Kaigh-Lachenbruch estimators. Over a variety of distributions little consistent difference is found between these estimators. An important conclusion, also made during the theoretical analysis, is that all of these estimators usually provide only modest improvement over the sample quantile. Our results indicate that even if one knew the best estimator for each situation one can expect an average improvement in efficiency of only 15%. Given the well-known distribution-free inference procedures (e.g., easily constructed …
منابع مشابه
On the Convolution Type Kernel Regression Estimator
This paper discusses modiications of the convolution type kernel regression estimator. One modiication uses kernel quantile estimators and is analyzed more detailed. This regression estimator combines advantages of local polynomial and kernel regression estimators and can be applied for small to large sample size. Its properties are illustrated by simulation results and asymptotic theory. Espec...
متن کاملNew Bandwidth Selection for Kernel Quantile Estimators
We propose a cross-validation method suitable for smoothing of kernel quantile estimators. In particular, our proposed method selects the bandwidth parameter, which is known to play a crucial role in kernel smoothing, based on unbiased estimation of a mean integrated squared error curve of which the minimising value determines an optimal bandwidth. This method is shown to lead to asymptotically...
متن کاملKernel and Nearest Neighbor Estimation of a Conditional Quantile
Let (Xl'Z1)' (X2,Z2),· .. ,(Xn,Zn) be LLd. as (X,Z), Z taking values in Rl, and for o < p < 1, let ep(x) denote the conditional p-quantile of Z given X=x, i.e., P(Z ~ ep(x) IX=x) = p. In this paper, kernel and nearest neighbor estimators of ep(x) are proposed. As a first step in studying the asymptotics of these estimates, Bahadur type ~ representations of the sample conditional quantile functi...
متن کاملA Direct Approach to Inference in Nonparametric and Semiparametric Quantile Regression Models
This paper makes two main contributions. First, we construct “density-free” confidence intervals and confidence bands for conditional quantiles in nonparametric and semiparametric quantile regression models. They are based on pairs of symmetrized k-NN quantile estimators at two appropriately chosen quantile levels. In contrast to Wald-type confidence intervals or bands based on the asymptotic d...
متن کاملKernel-smoothed Conditional Quantiles of Correlated Bivariate Discrete Data.
Socio-economic variables are often measured on a discrete scale or rounded to protect confidentiality. Nevertheless, when exploring the effect of a relevant covariate on the outcome distribution of a discrete response variable, virtually all common quantile regression methods require the distribution of the covariate to be continuous. This paper departs from this basic requirement by presenting...
متن کامل