Model-based Methods of Classification: Using the mclust Software in Chemometrics

نویسندگان

  • Chris Fraley
  • Adrian E. Raftery
چکیده

Due to recent advances in methods and software for model-based clustering, and to the interpretability of the results, clustering procedures based on probability models are increasingly preferred over heuristic methods. The clustering process estimates a model for the data that allows for overlapping clusters, producing a probabilistic clustering that quantifies the uncertainty of observations belonging to components of the mixture. The resulting clustering model can also be used for some other important problems in multivariate analysis, including density estimation and discriminant analysis. Examples of the use of model-based clustering and classification techniques in chemometric studies include multivariate image analysis, magnetic resonance imaging, microarray image segmentation, statistical process control, and food authenticity. We review model-based clustering and related methods for density estimation and discriminant analysis, and show how the R package mclust can be applied in each instance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhanced Model-Based Clustering, Density Estimation, and Discriminant Analysis Software: MCLUST

Abstract: MCLUST is a software package for model-based clustering, density estimation and discriminant analysis interfaced to the S-PLUS commercial software and the R language. It implements parameterized Gaussian hierarchical clustering algorithms and the EM algorithm for parameterized Gaussian mixture models with the possible addition of a Poisson noise term. Also included are functions that ...

متن کامل

Characterization and Classification of Iranian Honey Based on Physicochemical Properties and Antioxidant Activities, with Chemometrics Approach

In the present study, the physicochemical properties and antioxidant activities of different Iranian honey samples are investigated using various multivariate techniques in order to develop a quality control model. Forty-eight Iranian honey samples were tested for 15 physicochemical and antioxidant parameters. The parameters for which the samples were tested included color intensity, moisture, ...

متن کامل

Characterization and Classification of Iranian Honey Based on Physicochemical Properties and Antioxidant Activities, with Chemometrics Approach

In the present study, the physicochemical properties and antioxidant activities of different Iranian honey samples are investigated using various multivariate techniques in order to develop a quality control model. Forty-eight Iranian honey samples were tested for 15 physicochemical and antioxidant parameters. The parameters for which the samples were tested included color intensity, moisture, ...

متن کامل

Chemometrics-enhanced Classification of Source Rock Samples Using their Bulk Geochemical Data: Southern Persian Gulf Basin

Chemometric methods can enhance geochemical interpretations, especially when working with large datasets. With this aim, exploratory hierarchical cluster analysis (HCA) and principal component analysis (PCA) methods are used herein to study the bulk pyrolysis parameters of 534 samples from the Persian Gulf basin. These methods are powerful techniques for identifying the patterns of variations i...

متن کامل

mclust Version 4 for R: Normal Mixture Modeling for Model-Based Clustering, Classification, and Density Estimation

mclust is a contributed R package for model-based clustering, classification, and density estimation based on finite normal mixture modeling. It provides functions for parameter estimation via the EM algorithm for normal mixture models with a variety of covariance structures, and functions for simulation from these models. Also included are functions that combine model-based hierarchical cluste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007