Mixed Deep Gaussian Mixture Model: a clustering model for mixed datasets

نویسندگان

چکیده

Clustering mixed data presents numerous challenges inherent to the very heterogeneous nature of variables. A clustering algorithm should be able, despite this heterogeneity, extract discriminant pieces information from variables in order design groups. In work we introduce a multilayer architecture model-based method called Mixed Deep Gaussian Mixture Model that can viewed as an automatic way merge performed separately on continuous and non-continuous data. This is flexible adapted well or sense generalize Generalized Linear Latent Variable Models Models. We also new initialisation strategy driven selects best specification model optimal number clusters for given dataset. Besides, our provides low-dimensional representations which useful tool visualize datasets. Finally, validate performance approach comparing its results with state-of-the-art models over several commonly used

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mixture model clustering for mixed data with missing information

One di-culty with classi.cation studies is unobserved or missing observations that often occur in multivariate datasets. The mixture likelihood approach to clustering has been well developed and is much used, particularly for mixtures where the component distributions are multivariate normal. It is shown that this approach can be extended to analyse data with mixed categorical and continuous at...

متن کامل

Model-based clustering of Gaussian copulas for mixed data

Clustering task of mixed data is a challenging problem. In a probabilistic framework, the main difficulty is due to a shortage of conventional distributions for such data. In this paper, we propose to achieve the mixed data clustering with a Gaussian copula mixture model, since copulas, and in particular the Gaussian ones, are powerful tools for easily modelling the distribution of multivariate...

متن کامل

Mixture model of Gaussian copulas to cluster mixed-type data

A mixture model of Gaussian copulas is proposed to cluster mixed data. This approach allows to straightforwardly define simple multivariate intra-class dependency models while preserving classical distributions for the one-dimensional margins of each component in order to facilitate the model interpretation. Moreover, the intra-class dependencies are taken into account by the Gaussian copulas w...

متن کامل

Deep Autoencoding Gaussian Mixture Model

Unsupervised anomaly detection on multior high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation have made fruitful progress, they mainly suffer from decoupled model learning with inconsisten...

متن کامل

Deep Autoencoding Gaussian Mixture Model

Unsupervised anomaly detection on multior high-dimensional data is of great importance in both fundamental machine learning research and industrial applications, for which density estimation lies at the core. Although previous approaches based on dimensionality reduction followed by density estimation have made fruitful progress, they mainly suffer from decoupled model learning with inconsisten...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Advances in data analysis and classification

سال: 2021

ISSN: ['1862-5355', '1862-5347']

DOI: https://doi.org/10.1007/s11634-021-00466-3