Testing equality of covariance matrices when data are incomplete
نویسندگان
چکیده
In the statistics literature, a number of procedures have been proposed for testing equality of several groups’ covariance matrices when data are complete, but this problem has not been considered for incomplete data in a general setting. This paper proposes statistical tests for equality of covariance matrices when data are missing. AWald test (denoted by T1), a likelihood ratio test (LRT) (denoted by R), based on the assumption of normal populations are developed. It is well-known that for the complete data case the classic LRT and the Wald test constructed under the normality assumption perform poorly in instances when data are not from multivariate normal distributions. As expected, this is also the case for the incomplete data case and therefore has led us to construct a robustWald test (denoted by T2) that performs well for both normal and non-normal data.A re-scaled LRT (denoted by R∗) is also proposed. A simulation study is carried out to assess the performance of T1, T2, R, and R∗ in terms of closeness of their observed significance level to the nominal significance level as well as the power of these tests. It is found that T2 performs very well for both normal and non-normal data in both small and large samples. In addition to its usual applications, we have discussed the application of the proposed tests in testing whether a set of data are missing completely at random (MCAR). © 2006 Elsevier B.V. All rights reserved.
منابع مشابه
On Selecting Tests for Equality of Two Normal Mean Vectors.
The conventional approach for testing the equality of two normal mean vectors is to test first the equality of covariance matrices, and if the equality assumption is tenable, then use the two-sample Hotelling T (2) test. Otherwise one can use one of the approximate tests for the multivariate Behrens-Fisher problem. In this article, we study the properties of the Hotelling T (2) test, the conven...
متن کاملResampling-based methods in single and multiple testing for equality of covariance/correlation matrices.
Traditional resampling-based tests for homogeneity in covariance matrices across multiple groups resample residuals, that is, data centered by group means. These residuals do not share the same second moments when the null hypothesis is false, which makes them difficult to use in the setting of multiple testing. An alternative approach is to resample standardized residuals, data centered by gro...
متن کاملTests of some hypotheses on characteristic roots of covariance matrices not requiring normality assumptions
Test statistics for testing some hypotheses on characteristic roots of covariance matrices are presented, their asymptotic distribution is derived and a confidence interval for the proportional sum of the characteristic roots is constructed. The resulting procedures are robust against violation of the normality assumptions in the sense that they asymptotically possess chosen significance level ...
متن کاملError bounds for high–dimensional Edgeworth expansions for some tests on covariance matrices
Problems of testing three hypotheses : (i) equality of covariance matrices of several multivariate normal populations, (ii) sphericity, and (iii) that a covariance matrix is equal to a specified one, are treated. High–dimensional Edgeworth expansions of the null distributions of the modified likelihood ratio test statistics are derived. Computable error bounds of the expansions are derived for ...
متن کاملComputation of a Test Statistic in Data Quality Control
When processing observational data, statistical testing is an essential instrument to hopefully render harmless incidental anomalies and disturbances in the measurements. A commonly used test statistic based on the general linear model is the generalized likelihood ratio test statistic. The standard formula given in the literature for this test statistic is not defined if the noise covariance m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 51 شماره
صفحات -
تاریخ انتشار 2007