Efficient variable selection in support vector machines via the alternating direction method of multipliers

نویسندگان

Gui-Bo Ye

Yifei Chen

Xiaohui Xie

چکیده

The support vector machine (SVM) is a widely used tool for classification. Although commonly understood as a method of finding the maximum-margin hyperplane, it can also be formulated as a regularized function estimation problem, corresponding to a hinge loss function plus an l2-norm regulation term. The doubly regularized support vector machine (DrSVM) is a variant of the standard SVM, which introduces an additional l1-norm regularization term on the fitted coefficients. The combined l1 and l2 regularization, termed elastic net penalty, has the property of achieving simultaneous variable selection and margin-maximization within a single framework. However, because of the nondifferentiability of both the loss function and the regularization term, there is no efficient method available to solve DrSVM for large-scale problems. Here we develop an efficient algorithm based on the alternating direction method of multipliers (ADMM) to solve the optimization problem in DrSVM. The utility of the method is illustrated using both simulated and real-world data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers

Many problems of recent interest in statistics and machine learning can be posed in the framework of convex optimization. Due to the explosion in size and complexity of modern datasets, it is increasingly important to be able to solve problems with a very large number of features or training examples. As a result, both the decentralized collection or storage of these datasets as well as accompa...

متن کامل

Modified Convex Data Clustering Algorithm Based on Alternating Direction Method of Multipliers

Knowing the fact that the main weakness of the most standard methods including k-means and hierarchical data clustering is their sensitivity to initialization and trapping to local minima, this paper proposes a modification of convex data clustering in which there is no need to be peculiar about how to select initial values. Due to properly converting the task of optimization to an equivalent...

متن کامل

Alternating Direction Methods for Latent Variable Gaussian Graphical Model Selection

Chandrasekaran, Parrilo, and Willsky (2012) proposed a convex optimization problem for graphical model selection in the presence of unobserved variables. This convex optimization problem aims to estimate an inverse covariance matrix that can be decomposed into a sparse matrix minus a low-rank matrix from sample data. Solving this convex optimization problem is very challenging, especially for l...

متن کامل

An iterative SVM approach to feature selection and classification in high-dimensional datasets

The support vector machine (SVM) is the state-of-the-art classification method, and the doubly regularized SVM (DrSVM) is its important extension based on the elastic net penalty. DrSVM has been successfully applied in handling variable selection and while retaining (or discarding) correlated variables. However, it is challenging to solve this model. In this paper we develop an iterative l2-SVM...

متن کامل

Efficient Distributed Linear Classification Algorithms via the Alternating Direction Method of Multipliers

Linear classification has demonstrated success in many areas of applications. Modern algorithms for linear classification can train reasonably good models while going through the data in only tens of rounds. However, large data often does not fit in the memory of a single machine, which makes the bottleneck in large-scale learning the disk I/O, not the CPU. Following this observation, Yu et al....

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Efficient variable selection in support vector machines via the alternating direction method of multipliers

نویسندگان

چکیده

منابع مشابه

Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers

Modified Convex Data Clustering Algorithm Based on Alternating Direction Method of Multipliers

Alternating Direction Methods for Latent Variable Gaussian Graphical Model Selection

An iterative SVM approach to feature selection and classification in high-dimensional datasets

Efficient Distributed Linear Classification Algorithms via the Alternating Direction Method of Multipliers

عنوان ژورنال:

اشتراک گذاری