Hypothesis Test for Normal Mixture Models : the Em Approach
نویسندگان
چکیده
Normal mixture distributions are arguably the most important mixture models, and also the most technically challenging. The likelihood function of the normal mixture model is unbounded based on a set of random samples, unless an artificial bound is placed on its component variance parameter. Moreover, the model is not strongly identifiable so it is hard to differentiate between over dispersion caused by the presence of a mixture and that caused by a large variance, and it has infinite Fisher information with respect to mixing proportions. There has been extensive research on finite normal mixture models, but much of it addresses merely consistency of the point estimation or useful practical procedures, and many results require undesirable restrictions on the parameter space. We show that an EM-test for homogeneity is effective at overcoming many challenges in the context of finite normal mixtures. We find that the limiting distribution of the EM-test is a simple function of the 0.5χ0 + 0.5χ 2 1 and χ 2 1 distributions when the mixing variances are equal but unknown and the χ2 when variances are unequal and unknown. Simulations show that the limiting distributions approximate the finite sample distribution satisfactorily. Two genetic examples are used to illustrate the application of the EM-test.
منابع مشابه
MIXFIT: an algorithm for the automatic fitting and testing of normal mixture models
We consider the fitting of normal mixture models to multivariate data, using maximum likelihood via the EM algorithm. This approach requires the specification of an initial estimate of the vector of unknown parameters, or equivalently, of an initial classification of the data with respct to the components of the mixture model under fit. We describe an algorithm called MIXFIT that automatically ...
متن کاملThe Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models
In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...
متن کاملCalculation of Physical Properties of the Methanol-Water Mixture Using Molecular Dynamics Simulation
In this study some properties ofthe methanol-water mixture such as diffusivity, density, viscosity, and hydrogen bonding were calculated at different temperatures and <span style="font-size: 10pt; colo...
متن کاملOn some Variants of the EM Algorithm for the Fitting of Finite Mixture Models
Finite mixture models are being increasingly used in statistical inference and to provide a model-based approach to cluster analysis. Mixture models can be fitted to independent data in a straightforward manner via the expectation-maximization (EM) algorithm. In this paper, we look at ways of speeding up the fitting of normal mixture models by using variants of the EM, including the so-called s...
متن کامل