Classification of Imbalanced Data Represented as Binary Features
نویسندگان
چکیده
Typically, classification is conducted on a dataset that consists of numerical features and target classes. For instance, grayscale image, which usually represented as matrix integers varying from 0 to 255, enables one apply various algorithms image tasks. However, datasets binary cannot use many standard machine learning optimally, yet their amount not negligible. On the other hand, oversampling such synthetic minority technique (SMOTE) its variants are often used if for imbalanced. since SMOTE synthesize new samples based original samples, diversity synthesized highly limited due poor representation features. To solve this problem, preprocessing approach studied. By converting into ones using feature extraction methods, succeeding methods can fully display potential in improving classifiers’ performances. Through comprehensive experiments benchmark real medical datasets, it was observed converted consisting better (maximum improvements accuracy F1-score were 35.11% 42.17%, respectively). In addition, confirmed synergistically contribute improvement performance.
منابع مشابه
On Mining Fuzzy Classification Rules for Imbalanced Data
Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...
متن کاملOn Mining Fuzzy Classification Rules for Imbalanced Data
Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...
متن کاملMine Classification with Imbalanced Data
In binary classification problems it is common for the two classes to be imbalanced: one case is very rare compared to the other. Traditional classification approaches usually ignore this class imbalance, causing performance to suffer accordingly. In contrast, the algorithm infinitely imbalanced logistic regression (IILR) algorithm explicitly addresses class imbalance in its formulation. This p...
متن کاملExtract minimum positive and maximum negative features for imbalanced binary classification
In an imbalanced dataset, the positive and negative classes can be quite different in both size and distribution. This degrades the performance of many feature extraction methods and classifiers. This paper proposes a method for extracting minimum positive and maximum negative features (in terms of absolute value) for imbalanced binary classification. This paper develops two models to yield the...
متن کاملon mining fuzzy classification rules for imbalanced data
fuzzy rule-based classification system (frbcs) is a popular machine learning technique for classification purposes. one of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. however many cases the minority classes are more important than the majority ones. in this paper, we have extended ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app11177825