Biased Bootstrap Methods for Reducing the Effects of Contamination
نویسندگان
چکیده
Contamination of a sampled distribution, for example by a heavy-tailed distribution, can degrade the performance of a statistical estimator. We suggest a general approach to alleviating this problem, using a version of the weighted bootstrap. The idea is to “tilt” away from the contaminated distribution by a given (but arbitrary) amount, in a direction that minimises a measure of the new distribution’s dispersion. This theoretical proposal has a simple empirical version, which results in each data value being assigned a weight according to an assessment of its influence on dispersion. Importantly, distance can be measured directly in terms of the likely level of contamination, without reference to an empirical measure of scale. This makes the procedure particularly attractive for use in multivariate problems. It has a number of forms, depending on the definitions taken for dispersion and for distance between distributions. Examples of dispersion measures include variance, and generalisations based on high-order moments. Practicable measures of the distance between distributions may be based on power divergence, which includes Hellinger and Kullback–Leibler distances. The resulting location estimator has a smooth, redescending influence curve, and appears to avoid computational difficulties typically associated with redescending estimators. Its breakdown point can be located at any desired value ǫ ∈ (0, 2) simply by “trimming” to a known distance (depending only on ǫ and the choice of distance measure) from the empirical distribution. The estimator has an affine-equivariant multivariate form. Further, the general method is applicable to a range of statistical problems, including regression. Centre for Mathematics and Its Applications, Australian National University, Canberra, ACT 0200, Australia. CSIRO Mathematical and Information Sciences, North Ryde, Sydney. Department of Statistics, University of Florida, Gainesville, FL 32611-8545, United States. Short title. Reducing contamination.
منابع مشابه
Comparing two testing procedures in unbalanced two-way ANOVA models under heteroscedasticity: Approximate degree of freedom and parametric bootstrap approach
The classic F-test is usually used for testing the effects of factors in homoscedastic two-way ANOVA models. However, the assumption of equal cell variances is usually violated in practice. In recent years, several test procedures have been proposed for testing the effects of factors. In this paper, the two methods that are approximate degree of freedom (ADF) and parametric bootstr...
متن کاملReducing Bias without Prejudicing Sign
Jackknife and bootstrap bias corrections are based on a diierencing argument which does not necessarily respect the sign of the true parameter value. Depending on sampling variability they can over-correct, producing a nal estimator that is negative when one knows on physical grounds that it should be positive. To overcome this problem we suggest a simple, alternative bootstrap approach, based ...
متن کاملThe Effects of China's Growth in Manufacturing Sector in the U.S. Economy
T his paper investigates the gain of bilateral trade between China and U.S. in manufacturing sectors when both countries play a role in asymmetric (biased) growth of international trade. Our model includes a special case of Biased Growth Theory in international trade. We collected labor productivity, export and import data by using classification of manufacturing industries, for U.S...
متن کاملMultilevel bootstrap analysis with assumptions violated
Resumen Background: Likelihood-based methods can work poorly when the residuals are not normally distributed and the variances across clusters are heterogeneous. Method: The performance of two estimation methods, the non-parametric residual bootstrap (RB) and the restricted maximum likelihood (REML) for fi tting multilevel models are compared through simulation studies in terms of bias, coverag...
متن کاملThe Biased-bootstrap for Gmm Models
In this talk, I present some theoretical and empirical properties of the uniform and biased-bootstrap for generalized method of moments (GMM) models. The version of the biased-bootstrap used in this paper is a form of weighted bootstrap with weights chosen to satisfy some constraints imposed by the model. A typical biased-bootstrap resample is obtained by resampling from a member within a pseud...
متن کامل