نتایج جستجو برای: imbalanced data sampling

تعداد نتایج: 2528204  

Journal: :IAES International Journal of Artificial Intelligence 2022

<span lang="EN-US">Data imbalance is one of the problems in application machine learning and data mining. Often this occurs most essential needed case entities. Two approaches to overcome problem are level approach algorithm approach. This study aims get best model using pap smear dataset that combined levels with an algorithmic solve imbalanced. The laboratory mostly have few imbalance. ...

<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...

2013
Pengyi Yang Wei Liu Bing Bing Zhou Sanjay Chawla Albert Y. Zomaya

The wrapper feature selection approach is useful in identifying informative feature subsets from high-dimensional datasets. Typically, an inductive algorithm “wrapped” in a search algorithm is used to evaluate the merit of the selected features. However, significant bias may be introduced when dealing with highly imbalanced dataset. That is, the selected features may favour one class while bein...

2015
Maira Anis Mohsin Ali

Credit card fraud detection along with its inherent property of class imbalance is one of the major challenges faced by the financial institutions. Many classifiers are used for the fraud detection of imbalanced data. Imbalanced data withhold the performance of classifiers by setting up the overall accuracy as a performance measure. This makes the decision to be biased towards the majority clas...

Journal: :Appl. Soft Comput. 2009
Salvador García Alberto Fernández Francisco Herrera

Classification in imbalanced domains is a recent challenge in data mining. We refer to imbalanced classification when data presents many examples from one class and few from the other class, and the less representative class is the one which has more interest from the point of view of the learning task. One of the most used techniques to tackle this problem consists in preprocessing the data pr...

Journal: :Journal of Machine Learning Research 2015
Arash Pourhabib Bani K. Mallick Yu Ding

We propose an algorithm for two-class classification problems when the training data are imbalanced. This means the number of training instances in one of the classes is so low that the conventional classification algorithms become ineffective in detecting the minority class. We present a modification of the kernel Fisher discriminant analysis such that the imbalanced nature of the problem is e...

Journal: :Pattern Recognition 2014
Yuan-Hai Shao Wei-Jie Chen Jing-Jing Zhang Zhen Wang Nai-Yang Deng

In this paper, we propose an efficient weighted Lagrangian twin support vector machine (WLTSVM) for the imbalanced data classification based on using different training points for constructing the two proximal hyperplanes. The main contributions of our WLTSVM are: (1) a graph based under-sampling strategy is introduced to keep the proximity information, which is robustness to outliers, (2) the ...

2009
Thomas Debray Evgueni N. Smirnov Georgi Nalbantov Evgueni Smirnov

In this thesis we study the classification task in the presence of class imbalanced data. This task arises in many applications when we are interested in the under-represented (minority) classes. Examples of such applications are related to fraud detection, medical diagnosis and monitoring, text categorization, risk management, information retrieval and filtering. Although there exist many stan...

2016
Quan Do Thanh Pham Wei Liu Kotagiri Ramamohanarao

Learning from imbalanced and sparse data in multi-mode and high-dimensional tensor formats efficiently is a significant problem in data mining research. On one hand, Coupled Tensor Factorization (CTF) has become one of the most popular methods for joint analysis of heterogeneous sparse data generated from different sources. On the other hand, techniques such as sampling, cost-sensitive learning...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید