Peculiar Genes Selection: A new features selection method to improve classification performances in imbalanced data sets

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Peculiar Genes Selection: A new features selection method to improve classification performances in imbalanced data sets

High-Throughput technologies provide genomic and trascriptomic data that are suitable for biomarker detection for classification purposes. However, the high dimension of the output of such technologies and the characteristics of the data sets analysed represent an issue for the classification task. Here we present a new feature selection method based on three steps to detect class-specific biom...

متن کامل

A Novel One Sided Feature Selection Method for Imbalanced Text Classification

The imbalance data can be seen in various areas such as text classification, credit card fraud detection, risk management, web page classification, image classification, medical diagnosis/monitoring, and biological data analysis. The classification algorithms have more tendencies to the large class and might even deal with the minority class data as the outlier data. The text data is one of t...

متن کامل

A Feature Selection Method to Handle Imbalanced Data in Text Classification

Imbalanced data problem is often encountered in application of text classification. Feature selection, which could reduce the dimensionality of feature space and improve the performance of the classifier, is widely used in text classification. This paper presents a new feature selection method named NFS, which selects class information words rather than terms with high document frequency. To im...

متن کامل

Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets

In many real application areas, the data used are highly skewed and the number of instances for some classes are much higher than that of the other classes. Solving a classification task using such an imbalanced data-set is difficult due to the bias of the training towards the majority classes. The aim of this paper is to improve the performance of fuzzy rule based classification systems on imb...

متن کامل

Using Classification Techniques to Improve Replica Selection in Data Grid

Data grid is developed to facilitate sharing data and resources located in different parts of the world. The major barrier to support fast data access in a data grid is the high latency of wide area networks and the Internet. Data replication is adopted to improve data access performance. When different sites hold replicas, there are significant benefits while selecting the best replica. In thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: PLOS ONE

سال: 2017

ISSN: 1932-6203

DOI: 10.1371/journal.pone.0177475