Testing Endogeneity with High Dimensional Covariates∗

نویسندگان

  • Zijian Guo
  • Hyunseung Kang
  • T. Tony Cai
  • Dylan S. Small
چکیده

Modern, high dimensional data has renewed investigation on instrumental variables (IV) analysis, primary focusing on estimation of the included endogenous variable under sparsity and little attention towards specification tests. This paper studies in high dimensions the Durbin-Wu-Hausman (DWH) test, a popular specification test for endogeneity in IV regression. We show, surprisingly, that the DWH test maintains its size in high dimensions, but at an expense in power. We propose a new test that remedies this issue and has better power than the DWH test. Simulation studies reveal that our test achieves near-oracle performance to detect endogeneity. JEL classification: C12; C36

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Testing Endogeneity with Possibly Invalid Instruments and High Dimensional Covariates

The Durbin-Wu-Hausman (DWH) test is a commonly used test for endogeneity in instrumental variables (IV) regression. Unfortunately, the DWH test depends, among other things, on assuming all the instruments are valid, a rarity in practice. In this paper, we show that the DWH test often has distorted size even if one IV is invalid. Also, the DWH test may have low power when many, possibly high dim...

متن کامل

Simultaneous equations models for discrete outcomes: coherence, completeness, and identication

This paper studies simultaneous equations models for two or more discrete outcomes. The models allow latent variables to be high dimensional and require latent variables and covariates to be independently distributed. The models may be incoherent, delivering no values of the outcomes at certain values of the latent variables and covariates. The models may be incomplete, delivering more than one...

متن کامل

Joint Testing and False Discovery Rate Control in High-Dimensional Multivariate Regression

Multivariate regression with high-dimensional covariates has many applications in genomic 15 and genetic research, in which some covariates are expected to be associated with multiple responses. This paper considers joint testing for regression coefficients over multiple responses and develops simultaneous testing methods with false discovery rate control. The test statistic is based on inverse...

متن کامل

Handling Correlations Between Covariates and Random Slopes in Multilevel Models

This article discusses estimation of multilevel/hierarchical linear models that include cluster-level random intercepts and random slopes. Viewing the models as structural, the random intercepts and slopes represent the effects of omitted cluster-level covariates that may be correlated with included covariates. The resulting correlations between random effects (intercepts and slopes) and includ...

متن کامل

Testing covariates in high-dimensional regression

Abstract In a high-dimensional linear regressionmodel, we propose a new procedure for testing statistical significance of a subset of regression coefficients. Specifically, we employ the partial covariances between the response variable and the tested covariates to obtain a test statistic. The resulting test is applicable even if the predictor dimension is much larger than the sample size. Unde...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017