نتایج جستجو برای: imbalanced data sampling

تعداد نتایج: 2528204  

Journal: :Expert Syst. Appl. 2009
Show-Jane Yen Yue-Shi Lee

For classification problem, the training data will significantly influence the classification accuracy. However, the data in real-world applications often are imbalanced class distribution, that is, most of the data are in majority class and little data are in minority class. In this case, if all the data are used to be the training data, the classifier tends to predict that most of the incomin...

Journal: :Neurocomputing 2015
Jerzy Blaszczynski Jerzy Stefanowski

Various approaches to extend bagging ensembles for class imbalanced data are considered. First, we review known extensions and compare them in a comprehensive experimental study. The results show that integrating bagging with under-sampling is more powerful than over-sampling. They also allow to distinguish Roughly Balanced Bagging as the most accurate extension. Then, we point out that complex...

Journal: :IJKESDP 2011
Hien M. Nguyen Eric W. Cooper Katsuari Kamei

Traditional classification algorithms, in many times, perform poorly on imbalanced data sets in which some classes are heavily outnumbered by the remaining classes. For this kind of data, minority class instances, which are usually much more of interest, are often misclassified. The paper proposes a method to deal with them by changing class distribution through oversampling at the borderline b...

2014
Li Peng Yu Xiao-yang

This paper presents a SVM classification method based on cluster boundary sampling and sample pruning. We actively explore an effective solution to solve the difficult problem of imbalanced data set classification from data re-sampling and algorithm improving. Firstly, we creatively propose the method of cluster boundary sampling, using the clustering density threshold and the boundary density ...

2006
Show-Jane Yen Yue-Shi Lee

The most important factor of classification for improving classification accuracy is the training data. However, the data in real-world applications often are imbalanced class distribution, that is, most of the data are in majority class and little data are in minority class. In this case, if all the data are used to be the training data, the classifier tends to predict that most of the incomin...

Journal: :Pattern Recognition 2007
Jigang Xie Zhengding Qiu

This paper demonstrates that the imbalanced data sets have a negative effect on the performance of LDA theoretically. This theoretical analysis is confirmed by the experimental results: using several sampling methods to rebalance the imbalanced data sets, it is found that the performances of LDA on balanced data sets are superior to those of LDA on imbalanced data sets. 2006 Pattern Recognition...

Journal: :journal of advances in computer research 0

fuzzy rule-based classification system (frbcs) is a popular machine learning technique for classification purposes. one of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. however many cases the minority classes are more important than the majority ones. in this paper, we have extended ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید