Biases and Variances of Survey Estimators Based on Nearest Neighbor Imputation

نویسندگان

  • Jiahua Chen
  • Jun Shao
چکیده

NEAREST NEIGHBOR IMPUTATION Jiahua Chen1 University of Waterloo Jun Shao2 University of Wisconsin-Madison Abstract Nearest neighbor imputation is one of the hot deck methods used to compensate for nonresponse in sample surveys. Although it has a long history of application, theoretical properties of the nearest neighbor imputation method are unknown prior to the current paper. We show that under some conditions, the nearest neighbor imputation method provides asymptotically unbiased and consistent estimators of functions of population means (or totals), population distributions, and population quantiles. We also derive the asymptotic variances for estimators based on nearest neighbor imputation and consistent estimators of these asymptotic variances. Some simulation results show that the estimators based on nearest neighbor imputation and the proposed variance estimators have good performances.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods

Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...

متن کامل

Confidence Intervals Based On Survey Data With Nearest Neighbor Imputation

Nearest neighbor imputation (NNI) is a popular imputation method used to compensate for item nonresponse in sample surveys. Although previous results showed that the NNI sample mean and quantiles are consistent estimators of the population mean and quantiles, large sample inference procedures, such as asymptotic confidence intervals for the population mean and quantiles, are not available. For ...

متن کامل

Estimation of Density using Plotless Density Estimator Criteria in Arasbaran Forest

    Sampling methods have a theoretical basis and should be operational in different forests; therefore selecting an appropriate sampling method is effective for accurate estimation of forest characteristics. The purpose of this study was to estimate the stand density (number per hectare) in Arasbaran forest using a variety of the plotless density estimators of the nearest neighbors sampling me...

متن کامل

Asymptotic Behaviors of Nearest Neighbor Kernel Density Estimator in Left-truncated Data

Kernel density estimators are the basic tools for density estimation in non-parametric statistics.  The k-nearest neighbor kernel estimators represent a special form of kernel density estimators, in  which  the  bandwidth  is varied depending on the location of the sample points. In this paper‎, we  initially introduce the k-nearest neighbor kernel density estimator in the random left-truncatio...

متن کامل

Variance Estimation for Finite Populations with Imputed Data

One way of handling survey nonresponse is to impute data for each nonrespondent. When estimating sampling variances, however, treating the imputed data as a complete set frequently leads to underestimates of the true sampling variance. Techniques have been recently developed to yield valid variance estimates in the presence of imputed data for some estimators and sample designs. Economic survey...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007