Robust Mixture Modeling with Pearson Type VII Distribution

نویسنده

  • Jianyong Sun
چکیده

Mixture of Student t-distribution (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach by modeling the data sets with mixture of Pearson type VII distribution (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. Outlier detection criterion is derived from the EM solution. Controlled experimental results on synthetic datasets show that the MoP performs comparably, on average, with the MoT in terms of outlier detection accuracy and out-of-sample log-likelihood, but the MoP is more stable. Furthermore, we compared the performances of the Pearson type VII and the student t mixtures on the classification of several benchmark pattern recognition data sets. The comparison favors the developed Pearson type VII mixtures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust mixture clustering using Pearson type VII distribution

A mixture of Student t-distributions (MoT) has been widely used to model multivariate data sets with atypical observations, or outliers for robust clustering. In this paper, we developed a novel robust clustering approach bymodeling the data sets usingmixture of Pearson type VII distributions (MoP). An EM algorithm is developed for the maximum likelihood estimation of the model parameters. An o...

متن کامل

Evaluation and Application of the Gaussian-Log Gaussian Spatial Model for Robust Bayesian Prediction of Tehran Air Pollution Data

Air pollution is one of the major problems of Tehran metropolis. Regarding the fact that Tehran is surrounded by Alborz Mountains from three sides, the pollution due to the cars traffic and other polluting means causes the pollutants to be trapped in the city and have no exit without appropriate wind guff. Carbon monoxide (CO) is one of the most important sources of pollution in Tehran air. The...

متن کامل

Generalized Birnbaum-Saunders Distribution

The two-parameter Birnbaum–Saunders (BS) distribution was originally proposed as a failure time distribution for fatigue failure caused under cyclic loading. BS model is a positively skewed statistical distribution which has received great attention in recent decades. Several extensions of this distribution with various degrees of skewness, kurtosis and modality are considered. In particular, a...

متن کامل

Matrix Kummer-Pearson VII Relation and Polynomial Pearson VII Configuration Density

Abstract. A case of the matrix Kummer relation of Herz (1955) based on the Pearson VII type matrix model is derived in this paper. As a con- sequence, the polynomial Pearson VII configuration density is obtained and this sets the corresponding exact inference as a solvable aspect in shape theory. An application in postcode recognition, including a nu- merical comparison between the exact poly...

متن کامل

Estimating Velocity for Processive Motor Proteins with Random Detachment.

We show that, for a wide range of models, the empirical velocity of processive motor proteins has a limiting Pearson type VII distribution with finite mean but infinite variance. We develop maximum likelihood inference for this Pearson type VII distribution. In two simulation studies, we compare the performance of our MLE with the performance of standard Student's t-based inference. The studies...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010