A new test for sphericity of the covariance matrix for high dimensional data
نویسندگان
چکیده
AMS subject classifications: 62H10 62H15 Keywords: Covariance matrix Hypothesis testing High-dimensional data analysis a b s t r a c t In this paper we propose a new test procedure for sphericity of the covariance matrix when the dimensionality, p, exceeds that of the sample size, N = n + 1. Under the assumptions that (A) 0 < trΣ the concentration, a new statistic is developed utilizing the ratio of the fourth and second arithmetic means of the eigenvalues of the sample covariance matrix. The newly defined test has many desirable general asymptotic properties, such as normality and consistency when (n, p) → ∞. Our simulation results show that the new test is comparable to, and in some cases more powerful than, the tests for sphericity in the current literature. Many applications of modern multivariate statistics involve a large number of variables, p, and hence a large covariance matrix. In many situations (e.g. DNA microarray data) the dimensionality exceeds that of the number of observations, N = n +1. In this article, we discuss much of the previous work in developing statistics for testing if the covariance matrix is proportional to the identity, more commonly called Sphericity. We consider X 1 ,. .. , X N as a set of independent observations from a multivariate normal distribution N p (µ, Σ), where both the mean vector µ ∈ R p and covariance matrix Σ > 0 are unknown. We are interested in testing H 0 : Σ = σ 2 I vs. H A : Σ = σ 2 I, where σ 2 is the unknown scalar proportion. The classical hypothesis testing techniques are based on the likelihood ratio and are degenerate when p > n. Motivated by the previous work in the literature, we define a new test statistic under the framework known as general asymptotics or (n, p)-asymptotics. Much of the current work rests on the large body of literature regarding asymptotics for eigenvalues of the sample Yin and Krishnaiah [24] and others. We build on the substantial list of work completed on statistical testing involving large random matrices, such as Bai et al. [4], Saranadasa [13] and most recently the work completed by Ledoit and Wolf [11], Srivastava [21–23] and Schott [14–16]. Ledoit and Wolf [11] show the locally best invariant test based on John's U statistic, see [10], to be (n, p)-consistent when p/n → c …
منابع مشابه
Some Hypothesis Tests for the Covariance Matrix When the Dimension Is Large Compared to the Sample Size by Olivier Ledoit
This paper analyzes whether standard covariance matrix tests work when dimensionality is large, and in particular larger than sample size. In the latter case, the singularity of the sample covariance matrix makes likelihood ratio tests degenerate, but other tests based on quadratic forms of sample covariance matrix eigenvalues remain well-defined. We study the consistency property and limiting ...
متن کاملError bounds for high–dimensional Edgeworth expansions for some tests on covariance matrices
Problems of testing three hypotheses : (i) equality of covariance matrices of several multivariate normal populations, (ii) sphericity, and (iii) that a covariance matrix is equal to a specified one, are treated. High–dimensional Edgeworth expansions of the null distributions of the modified likelihood ratio test statistics are derived. Computable error bounds of the expansions are derived for ...
متن کاملAhmad Tests for Covariance Matrices , particularly for High - dimensional Data
Test statistics for sphericity and identity of the covariance matrix are presented, when the data are multivariate normal and the dimension, p, can exceed the sample size, n. The test statistics are shown to follow an approximate normal distribution for large p, also when p >> n. The statistics are derived under very general conditions, particularly avoiding any strict assumptions on the traces...
متن کاملLikelihood Ratio Test for the Hyper- Block Matrix Sphericity Covariance Structure – Characterization of the Exact Distribution and Development of Near-exact Distributions for the Test Statistic
• In this paper the authors introduce the hyper-block matrix sphericity test which is a generalization of both the block-matrix and the block-scalar sphericity tests and as such also of the common sphericity test. This test is a tool of crucial importance to verify elaborate assumptions on covariance matrix structures, namely on metaanalysis and error covariance structures in mixed models and m...
متن کاملMore powerful tests for sparse high-dimensional covariances matrices
This paper considers improving the power of tests for the identity and sphericity hypotheses regarding high dimensional covariance matrices. The power improvement is achieved by employing the banding estimator for the covariance matrices, which leads to significant reduction in the variance of the test statistics in high dimension. Theoretical justification and simulation experiments are provid...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Multivariate Analysis
دوره 101 شماره
صفحات -
تاریخ انتشار 2010