Weighted-bootstrap Alignment of Explanatory Variables
نویسندگان
چکیده
Adjustment for covariates is a time-honored tool in statistical analysis and is often implemented by including the covariates that one intends to adjust as additional predictors in a model. This adjustment often does not work well when the underlying model is misspecified. We consider here the situation where we compare a response between two groups. This response may depend on a covariate for which the distribution differs between the two groups one intends to compare. This creates the potential that observed differences are due to differences in covariate levels rather than “genuine” population differences that cannot be explained by covariate differences. We propose a bootstrap based adjustment method. Bootstrap weights are constructed with the aim of aligning bootstrap-weighted empirical distributions of the covariate between the two groups. Generally, the proposed weighted-bootstrap algorithm can be used to align or match the values of an explanatory variable as closely as desired to those of a given target distribution. We illustrate the proposed bootstrap adjustment method in simulations and in the analysis of data on the fecundity of historical cohorts of French-Canadian women.
منابع مشابه
Nonparametric tests for multi-parameter M-estimators
We consider likelihood ratio like test statistics based on M -estimators for multi-parameter hypotheses for some commonly used parametric models where the assumptions on which the standard test statistics are based are not justified. The nonparametric test statistics are based on empirical exponential families and permit us to give bootstrap methods for the tests. We further consider saddlepoin...
متن کاملA Consistent Nonparametric Bootstrap Test of Exogeneity
We propose a way of testing exogeneity of an explanatory variable without any parametric assumptions in the presence of a conditional "instrumental variable". A testable implication is derived that if an explanatory variable is exogenous, the conditional distribution of the outcome given explanatory variables is independent of the instrumental variable. We propose a consistent nonparametric boo...
متن کاملA WEIGHTED LINEAR REGRESSION MODEL FOR IMPERCISE RESPONSE
A weighted linear regression model with impercise response and p-real explanatory variables is analyzed. The LR fuzzy random variable is introduced and a metric is suggested for coping with this kind of variables. A least square solution for estimating the parameters of the model is derived. The result are illustrated by the means of some case studies.
متن کاملImproving Statistical Word Alignment with Ensemble Methods
This paper proposes an approach to improve statistical word alignment with ensemble methods. Two ensemble methods are investigated: bagging and cross-validation committees. On these two methods, both weighted voting and unweighted voting are compared under the word alignment task. In addition, we analyze the effect of different sizes of training sets on the bagging method. Experimental results ...
متن کاملOn properties of predictors derived with a two-step bootstrap model averaging approach - A simulation study in the linear regression model
In many applications of model selection there is a large number of explanatory variables and thus a large set of candidate models. Selecting one single model for further inference ignores model selection uncertainty. Often several models fit the data equally well. However, these models may differ in terms of the variables included and might lead to different predictions. To account for model se...
متن کامل