Convex covariate clustering for classification

نویسندگان

چکیده

Clustering, like covariate selection for classification, is an important step to compress and interpret the data. However, clustering of covariates often performed independently classification step, which can lead undesirable results that harm interpretability compression rate. Therefore, we propose a method cluster while taking into account class label information samples. We formulate problem as convex optimization uses both, a-priori similarity between covariates, from class-labeled Like ordinary [1], proposed offers unique global minima making it insensitive initialization. In order solve problem, specialized alternating direction multipliers (ADMM), scales up several thousands variables. Furthermore, in circumvent computationally expensive cross-validation, model criterion based on approximating marginal likelihood. Experiments synthetic real data confirm usefulness criterion.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Covariate Assisted Spectral Clustering

Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreover, in applications such as connectomics, social networks, and genomics, graph data are accompanie...

متن کامل

Joint covariate selection for grouped classification

We address the problem of recovering a common set of covariates that are relevant simultaneously to several classification problems. We propose a joint measure of complexity for the group of problems that couples covariate selection. By penalizing the sum of `2-norms of the blocks of coefficients associated with each covariate across different classification problems, we encourage similar spars...

متن کامل

Splitting Methods for Convex Clustering.

Clustering is a fundamental problem in many scientific applications. Standard methods such as k-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of k-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer. ...

متن کامل

Sparse Convex Clustering

Convex clustering, a convex relaxation of k-means clustering and hierarchical clustering, has drawn recent attentions since it nicely addresses the instability issue of traditional nonconvex clustering methods. Although its computational and statistical properties have been recently studied, the performance of convex clustering has not yet been investigated in the high-dimensional clustering sc...

متن کامل

Optimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines

In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition Letters

سال: 2021

ISSN: ['1872-7344', '0167-8655']

DOI: https://doi.org/10.1016/j.patrec.2021.08.012