On L1-norm multi-class support vector machines: methodology and theory∗
نویسندگان
چکیده
Binary Support Vector Machines have proven to deliver high performance. In multi-class classification, however, issues remain with respect to variable selection. One challenging issue is classification and variable selection in presence of a large number of variables in the magnitude of thousands, which greatly exceeds the size of training sample. This often occurs in genomics classification. To meet the challenge, this article proposes a novel multi-class support vector machine, which performs classification and variable selection simultaneously through an L1-norm penalized sparse representation. The proposed methodology, together with the developed regularization solution path, permits variable selection in such a situation. For the proposed methodology, a statistical learning theory is developed to quantify the generalization error to understand the basic structure of sparse learning, permitting the number of variables greatly exceeding the sample size. The operating characteristics of the methodology are examined via both simulated and benchmark data, and are compared against some competitors in terms of accuracy of prediction. The numerical results suggest that the proposed methodology is highly competitive. ∗ This research was supported in part by National Science Foundation Grant IIS0328802. The authors would like to thank the editor, the associate editor and three anonymous referees for helpful comments and suggestions.
منابع مشابه
Fault diagnosis in a distillation column using a support vector machine based classifier
Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...
متن کاملMining Biological Repetitive Sequences Using Support Vector Machines and Fuzzy SVM
Structural repetitive subsequences are most important portion of biological sequences, which play crucial roles on corresponding sequence’s fold and functionality. Biggest class of the repetitive subsequences is “Transposable Elements” which has its own sub-classes upon contexts’ structures. Many researches have been performed to criticality determine the structure and function of repetitiv...
متن کاملEmotional Cellular-Based Multi- Class Fuzzy Support Vector Machines on Product’s KANSEI Extraction
It is an important methodology to extract product’s overall KANSEI images by evaluating Critical Form Features (CFF). In this paper, Multi-class Fuzzy Support Vector Machines (MF-SVM) employing Emotional Cellular (EC) model was presented to extract KANSEI images of product’s CFF. EC is a very special kind of semantics cell, which is defined on two-dimensional (Valence-Arousal) emotional space. ...
متن کاملDempster-shafer Theory Based Multi-class Support Vector Machines and Their Applications
How to extend standard support vector machines to solve multi-class classification problem and yield the outputs in the frame of Dempster-Shafer theory is useful. The multi-class probability support vector machine is proposed, firstly. The Dempster-Shafer theory based multi-class support vector machine is designed by constructing probability support vector machines for binary classification usi...
متن کاملFace Recognition using Eigenfaces , PCA and Supprot Vector Machines
This paper is based on a combination of the principal component analysis (PCA), eigenface and support vector machines. Using N-fold method and with respect to the value of N, any person’s face images are divided into two sections. As a result, vectors of training features and test features are obtain ed. Classification precision and accuracy was examined with three different types of kernel and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006