Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

نویسندگان

  • F. Shirbani 1-MSc. Student, Control and Intelligent Processing Center of Excellence (CIPCE), Electrical and Computer Engineering Department, University of Tehran, Tehran, Iran
  • H. Soltanian Zadeh Professor, Department of Diagnostic Radiology, Henry Ford Hospital, Detroit, MI, USA 2- Professor, School of Cognitive Sciences (SCS), Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
چکیده مقاله:

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wrapper feature selection that takes advantage of a modified method of sequential forward floating search (SFFS) algorithm. The filtering approach evaluates the features for predicting the output and complementing the other features. The candidate subset generated by the filtering approach is used by k-fold cross validation of support vector machine (SVM) with user-defined classification margin as a wrapper. Applications of the proposed SFFS method to five biomedical datasets illustrate its superiority in terms of classification accuracy and execution time relative to the conventional SFFS method and another previously improved SFFS method.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

fast sffs-based algorithm for feature selection in biomedical datasets

biomedical datasets usually include a large number of features relative to the number of samples. however, some data dimensions may be less relevant or even irrelevant to the output class. selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. to this end, this paper presents a hybrid method of filter and wr...

متن کامل

Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

Measuring Stability of Feature Selection in Biomedical Datasets

An important step in the analysis of high-dimensional biomedical data is feature selection. Typically, a feature subset selected by a feature selection method is evaluated for relevance towards a task such as prediction or classification. Another important property of a feature selection method is stability that refers to robustness of the selected features to perturbations in the data. In biom...

متن کامل

feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets

objective(s): this study addresses feature selection for breast cancer diagnosis. the present process uses a wrapper approach using ga-based on feature selection and ps-classifier. the results of experiment show that the proposed model is comparable to the other models on wisconsin breast cancer datasets. materials and methods: to evaluate effectiveness of proposed feature selection method, we ...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 45  شماره 2

صفحات  43- 56

تاریخ انتشار 2013-11-01

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023