Hierarchical Clustering Using One-Class Support Vector Machines

نویسنده

  • Gyemin Lee
چکیده

This paper presents a novel hierarchical clustering method using support vector machines. A common approach for hierarchical clustering is to use distance for the task. However, different choices for computing inter-cluster distances often lead to fairly distinct clustering outcomes, causing interpretation difficulties in practice. In this paper, we propose to use a one-class support vector machine (OC-SVM) to directly find high-density regions of data. Our algorithm generates nested set estimates using the OC-SVM and exploits the hierarchical structure of the estimated sets. We demonstrate the proposed algorithm on synthetic datasets. The cluster hierarchy is visualized with dendrograms and spanning trees.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault diagnosis in a distillation column using a support vector machine based classifier

Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...

متن کامل

Mining Biological Repetitive Sequences Using Support Vector Machines and Fuzzy SVM

Structural repetitive subsequences are most important portion of biological sequences, which play crucial roles on corresponding sequence’s fold and functionality. Biggest class of the repetitive subsequences is “Transposable Elements” which has its own sub-classes upon contexts’ structures. Many researches have been performed to criticality determine the structure and function of repetitiv...

متن کامل

Conversational Topic Classification using SVMs and Features Induced by Clustering

This work explores the use of Support Vector Machines (SVM) for topic classification of conversations. An All-vs-One SVM system is used as the baseline. Several methods in feature weight scaling and feature selection are compared. Results suggest that the conversation domain requires a different set of methods from the written text domain. Finally, a feature selection method based on hierarchic...

متن کامل

Density Clustering Based SVM and Its Application to Polyadenylation Signals∗

Support vector machines (SVM) have been promising methods for classification analysis due to their solid mathematical foundations. Clustering-based SVMs are used to solve large samples classification problems and reduce the computational cost. In this paper, we present a density clustering based SVM(DCB-SVM) method to predict polyadenylation signal (PAS) in human DNA and mRNA sequences. We decr...

متن کامل

Protection or Privacy? Data Mining and Personal Data

A multiclass classification method based on output design p. 15 Regularized semi-supervised classification on manifold p. 20 Similarity-based sparse feature extraction using local manifold learning p. 30 Generalized conditional entropy and a metric splitting criterion for decision trees p. 35 RNBL-MN : a recursive naive Bayes learner for sequence classification p. 45 TRIPPER : rule learning usi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Symmetry

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2015