Mixed Deep Gaussian Mixture Model: a clustering model for mixed datasets
نویسندگان
چکیده
Clustering mixed data presents numerous challenges inherent to the very heterogeneous nature of variables. A clustering algorithm should be able, despite this heterogeneity, extract discriminant pieces information from variables in order design groups. In work we introduce a multilayer architecture model-based method called Mixed Deep Gaussian Mixture Model that can viewed as an automatic way merge performed separately on continuous and non-continuous data. This is flexible adapted well or sense generalize Generalized Linear Latent Variable Models Models. We also new initialisation strategy driven selects best specification model optimal number clusters for given dataset. Besides, our provides low-dimensional representations which useful tool visualize datasets. Finally, validate performance approach comparing its results with state-of-the-art models over several commonly used
منابع مشابه
Mixture model clustering for mixed data with missing information
One di-culty with classi.cation studies is unobserved or missing observations that often occur in multivariate datasets. The mixture likelihood approach to clustering has been well developed and is much used, particularly for mixtures where the component distributions are multivariate normal. It is shown that this approach can be extended to analyse data with mixed categorical and continuous at...
متن کاملModel-based clustering of Gaussian copulas for mixed data
Clustering task of mixed data is a challenging problem. In a probabilistic framework, the main difficulty is due to a shortage of conventional distributions for such data. In this paper, we propose to achieve the mixed data clustering with a Gaussian copula mixture model, since copulas, and in particular the Gaussian ones, are powerful tools for easily modelling the distribution of multivariate...
متن کاملMixture model of Gaussian copulas to cluster mixed-type data
A mixture model of Gaussian copulas is proposed to cluster mixed data. This approach allows to straightforwardly define simple multivariate intra-class dependency models while preserving classical distributions for the one-dimensional margins of each component in order to facilitate the model interpretation. Moreover, the intra-class dependencies are taken into account by the Gaussian copulas w...
متن کاملDeep Autoencoding Gaussian Mixture Model
Unsupervised anomaly detection on multior high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation have made fruitful progress, they mainly suffer from decoupled model learning with inconsisten...
متن کاملDeep Autoencoding Gaussian Mixture Model
Unsupervised anomaly detection on multior high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation have made fruitful progress, they mainly suffer from decoupled model learning with inconsisten...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Advances in data analysis and classification
سال: 2021
ISSN: ['1862-5355', '1862-5347']
DOI: https://doi.org/10.1007/s11634-021-00466-3