Balancing Strategies and Class Overlapping

نویسندگان

  • Gustavo E. A. P. A. Batista
  • Ronaldo C. Prati
  • Maria Carolina Monard
چکیده

Several studies have pointed out that class imbalance is a bottleneck in the performance achieved by standard supervised learning systems. However, a complete understanding of how this problem affects the performance of learning is still lacking. In previous work we identified that performance degradation is not solely caused by class imbalances, but is also related to the degree of class overlapping. In this work, we conduct our research a step further by investigating sampling strategies which aim to balance the training set. Our results show that these sampling strategies usually lead to a performance improvement for highly imbalanced data sets having highly overlapped classes. In addition, oversampling methods seem to outperform under-sampling methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intersectoral Planning for Public Health: Dilemmas and Challenges

Background Intersectoral action is often presented as essential in the promotion of population health and health equity. In Norway, national public health policies are based on the Health in All Policies (HiAP) approach that promotes whole-of-government responsibility. As part of the promotion of this intersectoral responsibility, p...

متن کامل

Reducing Health Inequities Through Intersectoral Action: Balancing Equity in Health With Equity for Other Social Goods

Significant attention has been devoted to developing intersectoral strategies to reduce health inequities; however, these strategies have largely neglected to consider how equity in health ought to be weighted and balanced with the pursuit of equity for other social goods (eg, education equity). Research in this domain is crucial, as the health sector’s pursuit of health equity may be at odds w...

متن کامل

Assembly line balancing problem with skilled and unskilled workers: The advantages of considering multi-manned workstations

This paper address a special class of generalized assembly line balancing in which it is assumed that there are two groups of workers: skilled and unskilled ones. The skilled workers are hired permanently while the unskilled ones can be hired temporarily in order to meet the seasonal demands. It is also assumed that more than one worker may be assigned to each workstation. To show the adv...

متن کامل

Numerical Implementation of Overlapping Balancing Domain Decomposition Methods on Unstructured Meshes

The Overlapping Balancing Domain Decomposition (OBDD) methods can be considered as an extension of the Balancing Domain Decomposition (BDD) methods to the case of overlapping subdomains. This new approach, has been proposed and studied in [4, 3]. In this paper, we will discuss its practical parallel implementation and present numerical experiments on large unstructured meshes.

متن کامل

Restricted overlapping balancing domain decomposition methods and restricted coarse problems for the Helmholtz problem

Overlapping balancing domain decomposition methods and their combination with restricted additive Schwarz methods are proposed for the Helmholtz equation. These new methods also extend previous work on non-overlapping balancing domain decomposition methods toward simplifying their coarse problems and local solvers. They also extend restricted Schwarz methods, originally designed to overlapping ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005