Convex covariate clustering for classification
نویسندگان
چکیده
Clustering, like covariate selection for classification, is an important step to compress and interpret the data. However, clustering of covariates often performed independently classification step, which can lead undesirable results that harm interpretability compression rate. Therefore, we propose a method cluster while taking into account class label information samples. We formulate problem as convex optimization uses both, a-priori similarity between covariates, from class-labeled Like ordinary [1], proposed offers unique global minima making it insensitive initialization. In order solve problem, specialized alternating direction multipliers (ADMM), scales up several thousands variables. Furthermore, in circumvent computationally expensive cross-validation, model criterion based on approximating marginal likelihood. Experiments synthetic real data confirm usefulness criterion.
منابع مشابه
Covariate Assisted Spectral Clustering
Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreover, in applications such as connectomics, social networks, and genomics, graph data are accompanie...
متن کاملJoint covariate selection for grouped classification
We address the problem of recovering a common set of covariates that are relevant simultaneously to several classification problems. We propose a joint measure of complexity for the group of problems that couples covariate selection. By penalizing the sum of `2-norms of the blocks of coefficients associated with each covariate across different classification problems, we encourage similar spars...
متن کاملSplitting Methods for Convex Clustering.
Clustering is a fundamental problem in many scientific applications. Standard methods such as k-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Recently introduced convex relaxations of k-means and hierarchical clustering shrink cluster centroids toward one another and ensure a unique global minimizer. ...
متن کاملSparse Convex Clustering
Convex clustering, a convex relaxation of k-means clustering and hierarchical clustering, has drawn recent attentions since it nicely addresses the instability issue of traditional nonconvex clustering methods. Although its computational and statistical properties have been recently studied, the performance of convex clustering has not yet been investigated in the high-dimensional clustering sc...
متن کاملOptimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines
In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition Letters
سال: 2021
ISSN: ['1872-7344', '0167-8655']
DOI: https://doi.org/10.1016/j.patrec.2021.08.012