نتایج جستجو برای: imbalanced data sampling

تعداد نتایج: 2528204  

Journal: :Advances in Science, Technology and Engineering Systems Journal 2017

2013
Peng Cao Dazhe Zhao

Class imbalance is one of the challenging problems for machine learning in many real-world applications. Many methods have been proposed to address and attempt to solve the problem, including sampling and cost-sensitive learning. The latter has attracted significant attention in recent years to solve the problem, but it is difficult to determine the precise misclassification costs in practice. ...

Journal: :CoRR 2017
Roghayeh Soleymani Eric Granger Giorgio Fumera

In practice, pattern recognition applications often suffer from imbalanced data distributions between classes, which may vary during operations w.r.t. the design data. Two-class classification systems designed using imbalanced data tend to recognize the majority (negative) class better, while the class of interest (positive class) often has the smaller number of samples. Several data-level tech...

پایان نامه :دانشگاه آزاد اسلامی - دانشگاه آزاد اسلامی واحد مرودشت - دانشکده علوم تربیتی و روانشناسی 1392

the purpose of this study was to investigate the relationship between family functioning and marital adjustment humor couples are due to the nature and objectives of the research and application of methods for its implementation correlation was used. the study population consisted of all the couples in the city who uses random cluster sampling of 200 students were selected as sample. data from ...

2013
M. Mostafizur Rahman D. N. Davis

Most medical datasets are not balanced in their class labels. Indeed in some cases it has been no ticed that the given class labels do not accurately represent characteristics of the data record. Most existing classification methods tend not to perform well on minority class examples when the dataset is extremely imbalanced. This is because they aim to optimize the overall accuracy without cons...

2012
Ahmad Abu Shanab Taghi M. Khoshgoftaar Randall Wald

Gene selection has become a vital component in the learning process when using high-dimensional gene expression data. Although extensive research has been done towards evaluating the performance of classifiers trained with the selected features, the stability of feature ranking techniques has received relatively little study. This work evaluates the robustness of eleven threshold-based feature ...

2012
Nele Verbiest Enislay Ramentol Chris Cornelis Francisco Herrera

In this paper, we present a prototype selection technique for imbalanced data, Fuzzy Rough Imbalanced Prototype Selection (FRIPS), to improve the quality of the artificial instances generated by the Synthetic Minority Over-sampling TEchnique (SMOTE). Using fuzzy rough set theory, the noise level of each instance is measured, and instances for which the noise level exceeds a certain threshold le...

2010
Piyasak Jeatrakul Kevin Kok Wai Wong Lance Chun Che Fung

In classification, when the distribution of the training data among classes is uneven, the learning algorithm is generally dominated by the feature of the majority classes. The features in the minority classes are normally difficult to be fully recognized. In this paper, a method is proposed to enhance the classification accuracy for the minority classes. The proposed method combines Synthetic ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید