Comparing Pure Parallel Ensemble Creation Techniques Against Bagging
نویسندگان
چکیده
We experimentally evaluate bagging and seven other randomization-based approaches to creating an ensemble of decision-tree classifiers. Unlike methods related to boosting, all of the eight approaches create each classifier in an ensemble independently of the other classifiers in the ensemble. Bagging uses randomization to create multiple training sets. Other approaches, such as those of Dietterich, apply randomization in selecting a test at a given node of a tree. Then there are approaches, such as Breiman’s random forests and Ho’s random subspaces, which apply randomization in the selection of attributes to be used in building the tree. Experiments were performed on 28 publicly available datasets, using C4.5 release 8 as the base classifier. While each of the other seven approaches has some strengths, we find that none of them is consistently more accurate than standard bagging when tested for statistical significance.
منابع مشابه
Application of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملComparing ensembles of decision trees and neural networks for one-day-ahead streamflow prediction
Ensemble learning methods have received remarkable attention in the recent years and led to considerable advancement in the performance of the regression and classification problems. Bagging and boosting are among the most popular ensemble learning techniques proposed to reduce the prediction error of learning machines. In this study, bagging and gradient boosting algorithms are incorporated in...
متن کاملBagging Classifiers for Fighting Poisoning Attacks in Adversarial Classification Tasks
Pattern recognition systems have been widely used in adversarial classification tasks like spam filtering and intrusion detection in computer networks. In these applications a malicious adversary may successfully mislead a classifier by “poisoning” its training data with carefully designed attacks. Bagging is a well-known ensemble construction method, where each classifier in the ensemble is tr...
متن کاملGenetic Approach to Feature Selection for Ensemble
Ensembles of classiiers have been shown to be very eeective for case-based classiication tasks. The vast majority of ensemble construction algorithms use the complete set of features available in the problem domain for the ensemble creation. Recent work on randomly selected subspaces for ensemble construction has been shown to improve the accuracy of the ensemble considerably. In this paper we ...
متن کاملAn Empirical Investigation on the Use of Diversity for Creation of Classifier Ensembles
We address one of the main open issues about the use of diversity in multiple classifier systems: the effectiveness of the explicit use of diversity measures for creation of classifier ensembles. So far, diversity measures have been mostly used for ensemble pruning, namely, for selecting a subset of classifiers out of an original, larger ensemble. Here we focus on pruning techniques based on fo...
متن کامل