Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

نویسندگان

چکیده

Abstract Feature selection based on the fuzzy neighborhood rough set model (FNRS) is highly popular in data mining. However, dependent function of FNRS only considers information present lower approximation decision while ignoring upper decision. This construction method may lead to loss some information. To solve this problem, paper proposes a joint entropy self-information measure (FNSIJE) and applies it feature selection. First, construct four uncertain measures variables, concept introduced into approximations from algebra view. The relationships between these their properties are discussed detail. It found that fourth measure, named tolerance self-information, has better classification performance. Second, an uncertainty been proposed Inspired by both views, FNSIJE proposed. Third, K–S test used delete features with weak distinguishing performance, which reduces dimensionality high-dimensional gene datasets, thereby reducing complexity then, forward algorithm provided. Experimental results show compared related methods, presented can select less important have higher accuracy.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neighborhood rough set based heterogeneous feature subset selection

Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rou...

متن کامل

Feature subset selection based on fuzzy neighborhood rough sets

Rough set theory has been extensively discussed in machine learning and pattern recognition. It provides us another important theoretical tool for feature selection. In this paper, we construct a novel rough set model for feature subset selection. First, we define the fuzzy decision of a sample by using the concept of fuzzy neighborhood. A parameterized fuzzy relation is introduced to character...

متن کامل

A New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)

Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...

متن کامل

Comparing Fuzzy-Rough and Fuzzy Entropy-assisted Fuzzy-Rough Feature Selection

Feature Selection (FS) methods based on fuzzy-rough set theory (FRFS) have employed the dependency function to guide the FS process with much success. More recently a method has been developed which uses fuzzy-entropy [9] to perform this task. Such use of fuzzy-entropy as an evaluation measure in fuzzy-rough feature selection can result in smaller subset sizes than those obtained through FRFS a...

متن کامل

Application of Fuzzy-rough Set Theory for Feature Subset Selection

Fuzzy Set Theory and Rough Set Theory are the most popular mathematical tools for dealing with uncertainties. During past decades, these set theories are being applied successfully in several areas for solving many complex tasks. This paper is concerned with the application of hybrid Fuzzy-Rough set based approach for feature subset selection. Keywords— Fuzzy set theory, Rough Set theory, Fuzzy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Complex & Intelligent Systems

سال: 2021

ISSN: ['2198-6053', '2199-4536']

DOI: https://doi.org/10.1007/s40747-021-00356-3